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Preface 



Quantum mechanics and quantum field theory are highly successful physical theo- 
ries that have numerous practical applications. Largely mathematical in character, 
these theories continue to stimulate the imaginations of applied mathematicians 
and purists as well. In recent years, in particular, as a new array of tools have 
emerged, including a representative amount from the domain of so-called pure 
mathematics, interest in both the conceptual and physical aspects of these beau- 
tiful subjects has especially blossomed. Given the emergence of newer and of- 
ten spectacular applications of mathematics to quantum theory, and to theoretical 
physics in general, one notes that certain communication gaps between physicists 
and mathematicians continue to be bridged. 

This text on quantum mechanics, designed primarily for mathematics students 
and researchers, is an attempt to bridge further gaps. Although the mathematical 
style presented is generally precise, it is counterbalanced at some points by a re- 
laxation of precision, as our overall purpose is to capture the basic flavor of the 
subject both formally and intuitively. The approach is one in which we attempt to 
maintain sensitivity with respect to diverse backgrounds of the readers, including 
those with modest backgrounds in physics. Thus we have included several con- 
crete computational examples to fortify stated principles, several appendices, and 
certain basic physical concepts that help to provide for a reasonably self-contained 
account of the material, especially in the first 1 1 chapters. 

Part I, which consists of the first ten chapters, is an introduction to the basic 
classical themes and ideas of quantum mechanics. Part II, Chapters 11-18, con- 
sists of selected special topics with some emphasis on path integrals and applica- 
tions of zeta functions and the Selberg trace formula. The material is somewhat 
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more technical in contrast to that of Part I. The last chapter is a further discussion 
of special topics by guest contributor Patrick Shanahan. 

Further remarks concerning the content and scope of this work now follow. 
In Chapter 1 we provide a brief overview of the physical aspects of the subject, 
with a historic discussion of some basic themes: Planck’s quantization, Bohr’s 
frequency rule, Schrodinger wave functions, deBroglie wavelength, Heisenberg’s 
uncertainty principle, emission spectra, Balmer’s formula, and so on — themes 
which set the tone of the work and which are subject to further analysis in Chap- 
ters 3, 5, 6, and 7. A discussion of matters such as spin, Pauli wave functions, 
Pauli’s exclusion principle, and the Zeeman effect, for example, are deferred to 
Chapters 9 and 10. 

Chapter 2 covers background material on classical mechanics, which is needed 
for the transition to quantum mechanics. One could employ symplectic machin- 
ery here in a modem approach, but we have chosen a more modest route — one 
suitable for our purposes. No effort is made, moreover, to consider geometric 
quantization — say in the context of a line bundle with a connection whose cur- 
vature is the symplectic form on a base manifold. Material from Chapter 2 is 
needed also for Chapters 8 and 18 where the motion of a charged particle in an 
electromagnetic field is considered. 

A study of hydrogen-like atoms occupies most of Chapter 5. The discussion 
there is extended in Chapter 7 which takes up spectroscopic selection mles for 
such atoms. Since these mles follow from a general abstract selection mle, which 
in fact is a pure group-theoretic result, we develop the group theory necessary for 
a proof of the latter. The statement presented clarifies some of those found in the 
physical literature which seem to rest on hypotheses not as well articulated. We 
also discuss, among other things, the group invariance of Schrodinger’s equation, 
and we provide some remarks on representations of the symmetric group for pur- 
poses of Chapter 10. It is observed, for example, that the latter group has exactly 
two irreducible representation of degree 1, up to equivalence — a mathematical 
fact which can be viewed as corresponding in some sense to the somewhat myste- 
rious fact that nature allows only for symmetric or antisymmetric wave functions 
in the description of indistinguishable particles. More extended and systematic 
applications of group theory are found in other excellent texts, particularly in the 
three references listed at the conclusion of Chapter 7. 

The Schrodinger equation in the context of gauge theory is considered in Chap- 
ter 18. That chapter, contributed by Pat Shanahan, takes as a starting point material 
from Chapter 8. An introduction to gauge theory is presented, with a discussion 
of the Maxwell and Yang-Mills equations. 

Appendix E in the general appendices at the end of the text is presented to help 
ease the transition from Part I to Part II, by providing a few minor comments on 
quantum field theory and statistics. Some of the material in Part II requires of the 
reader some elementary knowledge of Riemannian geometry. 

One of the special features of this book, not found in physics texts, is the 
systematic use of the theory of hypergeometric equations, by A. Nikiforov and 
V. Uvarov, to develop uniformly the theory of special functions and to solve 
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Schrodinger equations without the usual recourse to power series methods. By 
this approach, one obtains a priori quantization of energy conditions from a single 
point of view. This theory, developed in the fourth chapter, has many applications 
apart from quantum theory. We also indicate in Part II the important role, which 
is ever increasing, that automorphic forms play in physical theories. In particular, 
we consider some critical applications of Selberg’s trace formula. 
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Part I 



Introductory Concepts in 
Quantum Theory 




0 

Units of Measurement 



This book is largely targeted toward a mathematical audience, whose grasp and 
recollection of principles of physics may vary from small to great. It seems pru- 
dent therefore that we should adopt a somewhat self-contained approach and pro- 
ceed along elementary lines. In many cases we shall attempt to complement stated 
principles with concrete computational examples. Towards this end, one necessity 
is a review of some of the basic units of measurement. Also towards the end of 
providing a largely self-contained account we will supply generous amounts of 
detail at times in the discussion of much of the material. 

It will be convenient to mention terms like “energy,” or “radiation,” in this 
chapter and the next without providing some precision of meaning. Key terms 
and notions will be defined and developed as we move along. Some readers may 
choose to browse lightly through the remainder of this chapter and to return to it 
later as necessity dictates. 

Classical physics involves the notion of fundamental quantities such as length, 
time, mass, force, and energy. Various systems have been devised for measuring 
these quantities. Length (the distance between two points) can be measured in 
miles, feet, or meters, for example — the conversions being that 1 mile = 5, 280 
feet and 1 meter =3.28 feet. Time (a measure of duration) will usually be ex- 
pressed in seconds. Mass is a measure of inertia — i.e., of the resistance an object 
offers to a change in its motion as some force is applied to it. The notions of mass 
and force are not independent ones as we may regard force, defined intuitively, as 
a push or pull which tends to produce change of motion (or position) of an object. 
If one takes the notion of force as fundamental, for example, then one can define 
mass via Newton’s second law of motion. Namely, the acceleration a experienced 
by an object (i.e., the rate of change of its velocity) due to a force F acting on it 
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is directly proportional to F: a = kF for some k > 0. Then we can define the 
mass m of the object by m = ^ ; thus F = ma. Energy is a measure of the capacity 
for work to be done, work being the product of the magnitude of force and the 
displacement it causes. Energy may be potential (due to position) or kinetic (due 
to motion). A more careful discussion of these matters will be given in Chapter 2. 

In the English system mass is measured in pounds and force is measured in 
poundals: one poundal of force is that amount which will impart to a mass of one 
pound an acceleration of one foot per second in one second. We shall be interested 
in the French system where mass is measured in grams and force is measured 
in dynes: one dyne of force is that amount which will impart to a mass of one 
gram an acceleration of one centimeter per second in one second. Here a mass of 

def 

one pound = 453. 6 grams, a centimeter = meter/100 = 0. 0328 feet = 0. 3936 
inches. Therefore one poundal of force = 1 pound of mass xl ft./sec^ = 453.6 
grams = 13829.27-^ (where we also write “g” for grams, “cm” 

for centimeters) 1 dyne = ^^^^9 21 Poundal. One also refers to the e.g.s. 

system where length, mass, time are measured in centimeters, grams, seconds. 

For example, work (= force x distance) in the e.g.s. system is measured in dyne- 

2 

centimeters. One dyne-centimeter is called an erg (= 1 ^^)- 

Energy can be measured in erg units. For example, the kinetic energy T of an 
object of mass m with velocity of magnitude v is given by T = we shall 

see in Chapter 2 why the ^ appears in this definition. Thus in the e.g.s. system the 
units of T are grams x = ergs. One can also measure calories in terms of 
ergs: 1 calorie (the amount of heat required to raise the temperature of one gram 
of water one degree centigrade (or Celsius)) = 4. 187 x 10^ ergs. 

The unit of action (= work x times) is the erg-second. The fundamental con- 
stant that we consider is Planck's constant h given by 

h = 6. 62608 X 10“^^ erg-seconds. (0.0.1) 

We also write h for hjln (following Dirac): 

^ = 1.05457 X 10"^^ erg-seconds. (0.0.2) 

LK 

There will be occasions to consider periodic motion — for example the motion 
of a fixed point on the circumference of a circle. In this case the frequency v of 
the motion is the number of revolutions per second of the point, and the period 
T is the amount of time required for one revolution; thus v = ^ has the unit 
(second)"^. The unit of frequency is the hertz (Hz), in honor of Heinrich Hertz 
who discovered radio waves in 1887. One hertz = (second)“^ or one cycle per 
second, hx frequency is therefore measured in ergs, and therefore should rep- 
resent some energy quantity. This indeed is the case, as expressed by the Bohr 
frequency rule, discussed in Chapter 1, Section 1.2. Also note formula (1.1.1). 

Recall next that two electrically charged objects attract or repel each other ac- 
cording as the charges are opposite or like, respectively. In the e.g.s. Gaussian 
system charge is measured in electrostatic units (esu), defined as follows: One 




0. Units of Measurement 



5 



esu is that amount of charge on a test object for which an equal object of like 
charge is repelled by a force of 1 dyne if the two objects are placed 1 cm apart in 
a vacuum. Experiments in 1909 by Ehrenhaft and in 1910 by Millikan especially 
have determined that the charge -e on the electron is given by 

e = 4. 80321 x IQ-^’esu. (0.0.3) 



The esu is also called a stat-Coulomb. One has Coulomb’s law (due to Charles 
Coulomb (1736-1806)) 



F = K 



Q1Q2 



(0.0.4) 



which we shall discuss further in Chapter 2, for the magnitude F of the force of 
attraction or repulsion of two objects with charges q\,q 2 separated a distance r 
apart; x- > 0 is a suitable proportionality constant. 



Example 

Two objects of like charge experience a repulsive force of 9 dynes. They are 
separated a distance of 0. 20 cm apart. What is the modulus |^| of the charge q of 
these objects? 

Solution: By Coulomb’s law Kq^ = r^F = (0. 20)^ *9 cm^g^ = 0. 36^^ =» 
l^l = ^ esu. 

In addition to the electron charge -e given in (0.0.3) we shall need the electron, 



proton, and neutron masses m^, nip, ntn’. 

m, = 9. 10939 x lO”^* grams, (0.0.5) 

Mp = 1. 67263 X 10“^'' grams (0.0.6) 

= 1836. 16 m„ (0.0.7) 

m„ = 1.67493 x 10“^"* grams (0.0.8) 

= 1838. 69 me. (0.0.9) 



The hydrogen atom consists of an electron (with negative charge —e) and a proton 
with positive charge e. The reduced mass n of hydrogen is defined by /^ = . 

Thus by (0.0.5) and (0.0.7), 

1836. 16 ml 

^ = 0. 999 me = 9. 1044 x 10'^* grams. (0.0.10) 

1837. 16 me 



Given this value of // we can compute Bohr’s radius Uo defined by 

def h^ 

^ AK^jjie^ 



(0.0.11) 



First note indeed that Uq has the unit of length: By Coulomb’s law (0.0.4) e^ is 
measured in units (cm)^ dynes = g Then the quotient in (0.0.1 1) has units 



gx(cmr 

sec 



g2cmi 



= centimeters. 
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By equations (0.0.1), (0.0.3) and (0.0.10), 



Go — 



43. 904936 X 10-^4 
8292. 3234 x lO'^s 



cm = 0. 52946 x 10 ^ cm . 



(0.0.12) 



As we shall show in Chapter 5, contrary to Niels Bohr’s earlier version of the 
hydrogen atom which proposed that the electron assumes a circular orbit about 
its nucleus, the electron (in its lowest energy state) maintains an average distance 
of cm from the nucleus. We will calculate, moreover, that the probability of 
finding the electron within a sphere about the origin of radius Go is 0. 3235. When 
working with small atomic distances it is convenient at times to use Angstrom 

units (A): 1 A = 10”^ cm. Thus we may write Gq = 0. 529 A. 

The quantity 



def 

= 



eh 

2meC 



(0.0.13) 



where 



c = 2.99792458 x 10^^— (0.0.14) 

sec 

is the speed of light in a vacuum, will be encountered in Chapter 9. tis is called 
the Bohr mogneton. By (0.0.2), (0.0.3), (0.0.5), and (0.0.14), 



Bb 



= 9. 27401 X 10-2* 



esu-sec^ x erg 
gm X cm 



(0.0.15) 



We have enough information on hand to compute, for the record, the famous fine 
structure constGnt a ^ j he, a dimensionless quantity which controls minute 
spin and magnetic effects of electrons in atoms. By (0.0.2), (0.0.3), (0.0.14), and 
the remarks following (0.0.11), 



a - 



23.070826 x 10 cm^ dynes 
3. 1615212 X 10“^^ erg-sec cm sec“^ 
1 



(0.0.16) 



= 7.29738 x 10”" = 



137.036* 



For a particle of charge q and mass m the quantity co = ^ will appear in 
Chapters 8 and 9, where is a real number representing a certain magnetic field 



strength. The latter has units Thus co has units 

^ esu-cm 



esu erg sec 
g ^ esu cm^ 



esu sec^ gem^ 
gem^ esu sec^ 



Indeed w jin will be the frequency of a harmonic oscillator. A common mea- 
sure of magnetic field strength is the Tesla, after Nikola Tesla (1856-1943), a 
leading figure in the development of alternating current. The latter measure, how- 
ever, is based on SI (Systeme International) units where, for example, length, 
mass, and force are measured in meters, kilograms, and Newtons, respectively. 
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Quantum Mechanics: Some Remarks 
and Themes 



1 . 1 Planck Quantization 

Quantum mechanics to a large extent is a subject which conceptually is abstract 
and very mathematical in nature. Despite its abstract formalism it has proved to 
be a storehouse of ideas of enormous practical value. These radically new ideas, 
boldly proposed and strikingly uncommon, have helped to unlock deep secrets 
of the world, especially at the microscopic level where the breakdown of classi- 
cal physics becomes rather apparent. There flows indeed from this subject a vast 
stream of concrete, everyday life applications. The phenomenon of transistors, 
television, semiconductors, computers, lasers, to name a few, instruments of unde- 
bated importance for communication, education, energy requirements, medicine, 
etc. — all involve quantum mechanical effects. The work of R Lenard and espe- 
cially of A. Einstein on the photoelectric effect, for example, helped toward the 
development of television. Our world has been positively changed by the fruits of 
this elegant subject and its forward thinking founders. 

The present chapter provides for casual, sometimes heuristic and incomplete 
introductory remarks on some of the themes of these founders — ^remarks which 
thus serve also as a perspective for this book. We introduce, for example, Planck’s 
quantization, Bohr’s frequency rule, Schrodinger’s equation, de Broglie’s wave- 
length, and Heisenberg’s uncertainty principle. These themes are subject to further 
detailed analysis and development (in some cases) in ensuing chapters. 

December 14th, 1900 is considered to be the birthday of quantum mechanics. 
On that day Max Karl Ernst Ludwig Planck presented to the German Physical 
Society a paper entitled “On the Theory of the Energy Distribution Law of the 
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Normal Spectrum.” It is appropriate to formally begin this book with some re- 
marks on Planck’s quantization — his radical idea on which the aforementioned 
paper was based. 

The first sentence in Chapter 2 of Herman Weyl’s wonderful, scholarly book, 
The Theory of Groups and Quantum Mechanics [87], is well worth quoting here: 
“The magic formula 



E = hv (1.1.1) 

from which the whole of quantum theory is developed, establishes a universal 
relationship between the frequency v of an oscillatory process and the energy E 
associated with such a process.” Here /z is a universal constant of nature discov- 
ered in October, 1900 by Planck. Planck was led to formula (1.1.1) in his study 
of radiation emitted by a solid hot body. The problem was to find theoretically 
a formula for the energy density of the radiation as a function of its frequency 
V. Such a function provides for a spectral emission curve. Such curves were ob- 
tained from experiments prior to 1900, but physicists were unable to account for 
them on the basis of the classical theory of electrodynamics and statistical me- 
chanics. After some desperate attempts, Planck found that a satisfactory solution 
of the problem could be obtained if one made the following radical assumption: 
the energy of emitted (or absorbed) radiation cannot take on an arbitrary value but 
only integral multiples 0, /zv, 2/zv, 3/zv, ... of the basic energy value E = hv. Thus 
the energy is not arbitrary and “continuous” in value, but is quantized (restricted). 
This quantization of energy would be further looked into by Einstein. 

Five years later Einstein would explain that a beam of light was like a hail 
of particles, called quanta. Later the name photon (due to G. N. Lewis (1926)) 
would come into vogue. Einstein’s proposal was that all electromagnetic radia- 
tion of frequency v (compare Table 1.1 in the next section) consists of discrete 
particle-like “bundles” or “packets,” each of energy = hv. If an electromagnetic 
wave consists of 3000 such packets (photons), for example, then it has an en- 
ergy of 3000/zv units. The idea of quantization of energy has of course profound 
consequences concerning the structure of matter. 

Example 

Suppose a photon has a frequency of 6. 1 x 10^^ Hz. Using the value h = 
6. 626 X 10“^^ erg-sec (see equation (0.0.1)) we see that the photon has energy 
E = hv = 6. 626 X 10“^^ erg-sec x 6. 1 x lO^^sec"^ = 4. 042 x lO"^"^ ergs. We will 
discuss the relationship between the frequency v of a wave and its wavelength X 
in Section 1.4 of this chapter. As we shall see vX = the speed of the wave. By 
this fact, we have for a photon vX = c where c is the speed of light in a vacuum. 
By (0.0.14) 

c = 2. 99792458 x lO^^cm/sec. (1.1.2) 

3x10^® — 

The wavelength of the preceding photon is X = ^ = ^ = 0.492 x 10"^ 

cm, which is in the infrared range; compare Table 1. 1 in the next section. Consider 
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a second photon with wavelength = 5 x 10 ^ cm in the x-ray range. We claim 
that this x-ray photon is a million times more energetic than the infrared photon. 
Its energy is £ = /jv = f = ergs = 3.976 x lO'* ergs = 

0. 98 X 10^(4. 042 x lO”'"*) ergs, which verifies the claim. 



1 .2 The Bohr Frequency Rule and Emission Spectra 

In 1913, Niels Bohr, inspired by Planck’s quantization, formulated 2 l frequency 
rule which is described as follows. Suppose an atom (or some quantum system) 
undergoes a transition from one state of energy E to a second state of energy E'. 
Then radiation will be emitted or absorbed. If E > E' (i.e., the system proceeds 
from a state of higher energy to a state of lower energy) then radiation will be 
emitted. The frequency v of the emitted radiation will be given by E - E' = hv 
(which compares with Planck’s equation (1.1.1)). Similarly, if the system passes 
from a lower energy state E to a. higher energy state E' (E < E'), then it will 
absorb radiation of a frequency v given by E' - E = hv. Thus, in general 



|E-E'| = hv. (1.2.1) 

The frequency rule plays a decisive role in the theoretical derivation of formu- 
las for frequencies (or wavelengths) observed in certain spectroscopic lines — in 
particular in the derivation of BalmeEs formula. This application is one of the 
beautiful, spectacular achievements of the Bohr theory. Before considering such 
formulas and launching directly into a general discussion of spectroscopic lines, 
it will be useful to recall first the elementary notion of the spectrum of ordinary 
light. 

During the latter part of the 17th century Christian Huygens and Isaac Newton 
conducted major, initial investigations into the nature of light. In his celebrated 
prism experiment Newton demonstrated that ordinary “white light” was really a 
mixture of light of various colors ranging from red to violet. Namely, suppose 
a beam of light (sunlight, for example) is allowed through a narrow slit to enter 
a dark room and pass through a prism as indicated in Figure 1.1. As the beam 
strikes the prism it is refracted (i.e., it is bent due to a change in its speed). The 
refracted light leaving the prism is allowed to strike a screen in the dark room. 
Observing the screen, one will note a band of colors (red, orange, yellow, green, 
blue, indigo, violet) formed thereon, called the spectrum of visible light. 

The term dispersion refers to this spreading out or separation of white light 
into its constituent colors. Each color corresponds to a certain wavelength A, or 

frequency v = wavo noted earlier), with A ^ 4 x 10“^ cm for violet 

(with short wavelength and high frequency) and A 7. 5 x 10“^ cm for red (with 
long wavelength and low frequency). 

The spectrum of light is part of the general electromagnetic spectrum of a range 
of waves with wavelengths greater than or less than those of visible light, given 
in the following table. 
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dark room 



narrow 
slit ^ 




Figure 1.1: Newton’s prism experiment 



In other words, light is simply electromagnetic radiation in a certain frequency 
range detectable by the human eye (as discovered by James Maxwell and Heinrich 
Hertz). 

The values given in Table 1 . 1 are approximate and they vary from one text to 
another. 

Newton’s prism experiment concerning the spectrum of light eventually prompt- 
ed the study of emission spectra of various atoms and molecules. The latter study 
provides deep insight into atomic and molecular structure, and provides in par- 
ticular experimental proof that only certain specific energy states can be realized. 
Just as each of us as individuals has a distinguished set of identifying fingerprints, 
atoms, molecules, elements, etc. have a distinguished emission spectrum, which 
might be continuous, or might consist of lines. Similar to Figure 1 . 1 one has the 
following typical set-up in diagram form for observing a line spectrum, say of a 
luminous gas for example. 

In the discharge tube (containing the gas) electrons flowing from the negative 
electrode to the positive electrode collide with the gas. From these collisions there 
results the emission of light or radiation by the atoms of the gas. As in Figure 1.1 
this light when passed through a prism and allowed to collect on a plate appears 
as a set of lines, colored images corresponding to some definite wavelength. The 
fact that these lines are discrete (each is separated by a particular distance) cor- 
responds to the fact that the atoms in their excited state emit their radiation only 
quantum mechanically — i.e., at a discrete set of specific wavelengths. In this ex- 
ample no continuous distribution of wavelengths is observed. 
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Table 1.1: Electromagnetic spectrum 





X 


V = I 




wavelength 


frequency 




in cm 


in Hz 


long radio waves 


0 

1 

o 


3 X 103 - 30 


AM radio waves 


10^ - 10'* 


3 X 107 - 3 X 10® 


FM radio and TV waves 


1.5 X 1Q2-7.5 X 10^ 


2x 10*-4x 107 


short radio waves 


10“‘ - 1 


3 X 10“ - 3 X 10*0 


microwaves 


0 

1 

to 

1 

o 


3 X 10*3 - 3 X 10^ 


infrared waves 


10-3 _ iq -2 


3 X 10*3 - 3 X 1012 


visible light 


7. 5 X 10-3 _ 4 ^ 10-5 


4x 10*^ -7. 5 X 10*"* 


ultraviolet rays 


10-7 _ 10-4 


3 X 10*3 - 3 X 10*4 


x-rays 


o 

1 

0 

1 

O 

1 

o^ 


3 X 1030 - 3 X 10*® 


gamma rays 


10-'3 - 10-* 


3 X 1033 - 3 X 10*^ 



In the important case of hydrogen, for example, for each integer n > I there 
is a corresponding (infinite) set or series of spectral lines For n = 

1, 2, 3, 4, 5 for example we obtain the following five series, each named after its 
discoverer: the Lyman series the Balmer series the Paschen 

series the Brackett series and the Pfund series re- 

spectively. For the Balmer series we have the following illustration of its spectrum 
(in part). Recall that 1 Angstrom = 10"^ cm. 

In 1885 Johann Balmer offered a simple formula to account for the empirical 
wavelength values appearing in Figure 1.3. His formula was 




y = 3,4,5,... 



( 1 . 2 . 2 ) 



( 2 ) 

for the wavelength X of the line L- , where K is some constant. On the basis of 
experimental evidence he deduced that by taking -- = 91 1. 76 A formula (1.2.2) 
found agreement with the known data. Thus 



Take y = 8 for example. Then formula (1.2.3) gives X = 911.76(||;^)A = 

3890. 176 A for the wavelength corresponding to the line — which is the value 

given in Figure 1.3. If we let J oo in equation (1.2.3) we obtain the wavelength 
Aoo = 3647. 04 A which the limiting line in Figure 1.3 corresponds to. 

Although Balmer’s formula provides an accurate description of the wavelengths 
occurring in his series in Figure 1.3, the formula does not provide an explanation 
of spectral discreteness. A theoretical derivation of Balmer’s formula, and of the 
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spectral 

lines 



more general Ritz-Rydberg type formulas for wavelengths in other series, is pos- 
sible as we shall see now. In Chapter 5 it will be shown in detail that the solution of 
Schrodinger’s equation (discussed in the next section) leads to the energy states of 
hydrogen, which are given by a discrete sequence Thus Schrodinger’s 

equation carries within itself (and supersedes) Planck’s quantization, and it car- 
ries in particular (in conjunction with the Bohr frequency rule) the explanation of 
spectral discreteness. Namely, is given by 



(„) ^ _2^/eV 



(1.2.4) 



where -e is the electron charge and fi is the reduced mass of hydrogen; see (0.0.3), 
(0.0.10). Define 



so that by ( 1 .2.4) 



h?c 




(1.2.5) 



( 1 . 2 . 6 ) 
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L 



(2) 

3 



r(2) 



r(2) 



^( 2 ) 

-^6 

( 2 ) 

7 
( 2 ) 

8 

( 2 ) 

9 

( 2 ) 

10 



r(2) 



Wavelengths A 
in Angstroms 

infrared 



6564. 7 red 



4862. 7 blue-green 

4341.7 blue 

4102.9 violet 

3971.2 

3890. 2 
3836.5 

3799.0 



ultraviolet 



Figure 1.3: Photographic plate for the Balmer series (experimental observation) 



Here c is the speed of light; see (1.1.2). Equation (1.2.6) can also be obtained 
by the Bohr theory. R is called the Rydberg constant (after the Swedish physicist 
Johannes Rydberg). Now by the Bohr frequency rule, a transition ^ 
from one energy state to another leads to radiation of frequency v given by equa- 
tion (1.2.1): 



/JV = I = 



1 

(«')2 




(1.2.7) 



If H 2 > «i (i.e., then in the transition -*■ E^"'^ radiation will 

be emitted (as we have seen), and since the frequency v and wavelength A of this 
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radiation are related by c = iv we may express equation (1.2.7) as 

l = (1.2.8) 

which gives Balmer’s formula (1.2.2): take n\ = 2. More generally (1.2.8) is the 
Ritz-Rydberg formula. The experimental value of R is 

Re = 109678 cm"^ = 0.00109678(A)"^ (1.2.9) 

The theoretical value Rt of R is given by definition (1.2.5). By (0.0.1), (0.0.3), 

( 0 . 0 . 10 ), ( 1 . 1 . 2 ) 



Rt = 109677.78 cm"^ = 0.0010967778(A)"^ (1.2.10) 

which agrees with (1.2.9). Here recall that the units of are gram (by 
Coulomb’s law) and therefore the units of are gram^^^, which implies that 
the units of are indeed cm“^ . Note moreover that ^ = 91 1. 74 A so that 

Rt is in close agreement with Balmer value ^ = 911.76 A given earlier; i.e., 

Rtr^K. 

( 2 ) ( 2 ) ( 2 ) 

The lines in Figure 1.3 which we have labeled , . . . are usually 

labeled Hp, Hy , ... in spectroscopy. We now see that for the Balmer series 
(where n\ = 2 in (1.2.8)) the red line = Ha corresponds to the transition 
rO) £(2) — a higher energy state to a lower in which radiation of wavelength 
A is emitted, where A is given theoretically by equation (1.2.8), where we use 
“ = 911.74 A: A = 911.74 (| - ^) ^ A= 6564.53 A, which compares with 
the value A = 6564. 7 A of Figure 1.3. Similarly the blue-green line = Hp 
corresponds to the transition -> the blue line = Hy corresponds 
to the transition E^^^ E^^\ etc. The wavelengths of the emitted radiation are 

given (theoretically) by A = 911.74 A, 911.74 A, • • • = 

4862.61 A, 4341.62 A, . . . (for the transitions E^"^^ E^^\ E^^^ E^^\ . . . ) 

respectively — which differ just slightly from the values given in Figure 1.3. Fur- 
ther computational examples for other spectral series will be given later. As we 
have indicated a careful derivation of formula (1.2.4) (on which the preceding 
analysis was based) will be given in Chapter 5. A “theoretical” account of the 
Bohr frequency rule will appear in Chapter 7; see remarks in (i) following equa- 
tion (7.7.58). 

1.3 Schrodinger’s Equation 

Bohr’s theory was remarkably accurate in accounting for the spectrum of hydro- 
gen, as we have noted. For atoms with two electrons however, such as Helium, 
predictions of the theory conflicted with experimental results. Corrections to and 
generalizations of the theory were provided by the gifted teacher Arnold Sommer- 
feld, about 1916-1919. Despite this, these earlier versions of quantum mechanics. 
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with their limited applicability and lack of a firm conceptual foundation, would 
have to give way to a more substantial and comprehensive theory. Credits would 
be due to great theorists such as L. de Broglie, E. Schrodinger, W. Heisenberg, 
M. Bom, P. Jordan, P. Dirac, W. Pauli, and to others. 

In 1924 de Broglie assigned a wavelength A to an electron (or to an arbitrary 
particle) moving with a speed v. Thus he associated a wave character or behavior 
to a particle of mass m, a character which Einstein had noted in his work (on the 
photoelectric effect, for example). De Broglie’s precise mathematical formula was 




mv 



h 

p’ 



(1.3.1) 



where p = mv is the momentum of the particle. In the next section we will discuss 
de Broglie’s bold discovery in more detail. 

The task of describing mathematically a de Broglie matter wave (or of setting 
up a wave function y/ for a particle) was taken up by Erwin Schrodinger in a 
series of classic papers [75, 76] in 1926, wherein he announced the renowned 
wave equation, which in one dimension reads 

fp' dfw 

= 0. (1.3.2) 

2m dx^ 

Here E G R is a fixed real number (which will represent the quantum energy 
of the particle), and V{x) is the potential energy of the particle; potential and 
kinetic energy will be defined carefully in the next chapter. Schrbdinger’s equa- 
tion (in one dimension or in higher dimensions) is one of the most important 
equations of modem physics. It is the key equation that we will consider, of 
course. Schrodinger showed that his new wave mechanics was equivalent to the 
matrix mechanics version of quantum theory by Heisenberg, Bom, and Jordan, 
and moreover that his equation, in his words, “carries within itself the quantum 
conditions” — conditions noted by Bohr and Sommerfeld, which were also dis- 
covered by W. Wilson and J. Ishiwara in 1915; compare Section 3.7, Chapter 3; 
in particular see equation (3.7.8) there. Namely, generally speaking, energies of 
a quantum system cannot assume arbitrarily prescribed values, but are given by a 
discrete set of eigenvalues E of a second-order differential operator H, say 

h^ 

2m dx^ 

in the one-dimensional case, where My is multiplication by the function V{x), 
since equation (1.3.2) can be written as 

H\f/ = Ey/, (1.3.4) 

where one requires a non-zero square-integrable solution y/ over R: 

IMP‘= [ \w(x)\^dx<Qo 
m 



(1.3.5) 
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for dx = Lebesgue measure on R. From a normalized wave function y/ (i.e., 
y/ such that \\y/\\^ = 1) one extracts the following physical information — Max 
Bom’s profound probabilistic interpretation (1926): 



\y/(x)f' dx is the probability that the particle 
will be located within the interval [a, b]. 
Or in three dimensions, for example, 

I \y/(x, y, z)\^ dxdydz is the probability 



^ . 



of finding the particle in a region 91 in space 



(1.3.6) 



(1.3.7) 



(compare, for example. Section 5.5 of Chapter 5), where equation (1.3.2) is re- 
placed by the equation 



2m 



( d^y/ d^y/ d^y/\ 

dx^ dy^ dz^ ) 



+ {E-V (x, y, z))y/{x, y, z) = 0. 



(1.3.8) 



Equations (1.3.2), (1.3.8) are time independent Schrodinger equations, com- 
pared to the more general time dependent Schrodinger equations 



dw d^W def 

m-^{x. 0 = ^(x, 0 + V{x)yf{x. t) = Hy/{x, t), 

dt 2m dx^ 



(1.3.9) 



ih—{x,y,z,t) = -:r- 
at 2m 



\ dx^ dy^ dz} ) 



+ V(x,y,z)y/ix,y, z, t). 

(1.3.10) 



If y/{x) is a solution of the time independent equation (1.3.2), for some (energy 
level) E, then we easily obtain a time dependent solution y/{xj) of (1.3.9) by 
setting 



y/{x,t) = y/(x)txp(Et/ih). (1.3.11) 

Similarly, a solution y/(x, y, z) of (1.3.8) leads immediately to a solution 

def 

y/(x,y,z,t) = y/(x,y,z)cxTp{Et/ih) (1.3.12) 

of equation (1.3.10). Thus we may focus on the time independent equation. 

There is a heuristic procedure for obtaining the right-hand side of the time 
dependent equations (1.3.9), (1.3.10). For example we obtain the differential op- 
erator H in (1.3.3) (or in (1.3.9)) by employing the “quantization mles” p = 
mv Mv by which the operators My correspond to the “clas- 
sical observables” p, V. Here p^ (f kinetic en- 
ergy T jmv^ = ^ ^ energy T + V, ox Hamiltonian, 
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^ + My H, as desired (which is the reason for the symbol H). Note 

that the operators Dx^= f ^ , Mx satisfy the Heisenberg-Bom-Jordan commuta- 
tion relations 

[Dx, MJ = D^oM,-M,oD, = t.i, (1,3.13) 

I 

In later chapters a good deal more will be said about Schrodinger’s equation and 
solutions of it in particular cases. 

1 .4 The de Broglie Wavelength 

In his famous 1924 Paris doctoral thesis, Louis Victor de Broglie boldly pro- 
claimed that to a particle of mass m moving with a velocity v is associated a 
wavelength A given by A = ^. Strictly speaking this equations holds for non- 
relativistic particles where v is much less than the velocity of light, h is expressed 
in units of — so that indeed ^ is a length quantity. Notice that from 
time to time we will, carelessly, fail to distinguish between velocity v and speed- 
the magnitude |v| of v. Thus A = -^ really means A = De Broglie’s claim 
was verified by experiments of C. Davisson and L. Germer, and by G. Thompson, 
O. Stem, F. Rupp and others. These interesting experiments demonstrated for 
example that beams of electrons could indeed be diffracted. Interference phenom- 
ena in such beams were also noted. Even before the birth of quantum mechanics 
Einstein noted wave-like properties of atoms in a monotonic gas. The wave-like 
character of a moving particle is now very well established. We will indicate one 
route by which one might arrive at the fundamental assertion A = ^. For this 
we pause for a moment to understand the notion of a traveling wave and that of a 
wavelength. 

Consider first a point P which travels along the circumference of a circle F, 
uniformly in a counterclockwise direction with angular speed w measured in ra- 
dians per second. Call R the radius of F, and call 0 = 9(t) the angular position of 
P at time t as indicated in Figure 1.4. 

0 ) ^ ==> 0(t) = tco + 6(0) which implies that the vertical displacement 

y is given by y = RsinO = R sin(tco + ^(0)). y is plotted as a function of t on 
the right side of the diagram in Figure 1.4. The amount of time required for P to 
complete one revolution about the center of F is the period T of the motion: For 
any instant r, 6(t + P) = 0(t) -f 2n\ i.e., (t + T)co + 6(0) = tco + 6(0) + 27 t => 
T = ^. Tbt frequency v of the motion is the number of revolutions per second: 
y = f ==> y = Rsm(tco + 6(0)) = Psin(Y^ + ^(0)) = Psin(2;rvr -h ^(0)), 
which describes the variation of the displacement y with respect to time t. Note 
that V need not be a whole number. 

For the concept of a traveling wave one considers however displacement with 
respect to time and distance. Thus given a fixed constant C (which will have a 
physical meaning presently) consider the function / of two variables t, x given 
by 

f(t,x) = y(^^ '■= Psin 2;rv -h 6>(0)| . 



(1.4.1) 
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One checks immediately that / is a solution of the wave equation 



d^u ^2 



(1.4.2) 



The plots of X ^ /(O, x), / (t, x) (for t fixed) represent, intuitively, a wave that has 
traveled to the right a distance x'^ - x^ in ^ seconds, the dashed curve in Figure 1.5 
being x ^ f(t,x). The speed of the wave is given by the distance traveled divided 
by the time. Now x^ = —^7 and x'^ = Xo+Ct => x^-x^ = Ct the speed 
of the wave is the constant C. The solid curve (x -> /(O, x)) has a maximum at 
the indicated points mu m 2 , m 3 , where mi = £ - m 2 = ^ + x^. 

By definition, the wavelength X is the distance between two successive crests: 



, def 



= Rsin 



c 

mi = - 




(1.4.3) 


V 




(1.4.4) 


Ikx 






— -®^ + 0(O) 




(1.4.5) 



in equation (1.4.1). Thus, abstractly, we regard a traveling wave as a function / 
of two variables (“time and position”) given by 



/ {t, x) = A sin(kx =F cut + ^) (1 .4.6) 

= Acos(kx qp cut + (5i) ioYd\=6—^ (1.4.7) 

for real constants A, k, co, S, called the harmonic amplitude, wave number (or 
propagation constant), angular speed (or angular frequency), and phase con- 
stant, respectively, of /. / “moves to the right or left” according to the choice 
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y 




Figure 1.5: Traveling wave 



of — or + respectively in (1.4.6). The, frequency v of / is the wavelength X is 
given by A: = and the speed or phase velocity C of / is given by C = vA = 
compare (1.4.3), (1.4.4), (1.4.5). / satisfies the wave equation (1.4.2). Using the 
identity sin(a T^b) = sin acosb^^ cos a sin b we can write 

/(r,x) = /l(^x)=F/ 2 (^x) for (1.4.8) 

fi(t,x) = A sin{kx + S) cos cot, ( 1 .4.9) 

f 2 (t,x) = Acos(kx + ^) sincot, (1.4.10) 

where in /i, /2 the variables r, x are separated. Moreover /i, /2 are also solutions 
of the wave equation (1.4.2) (since C^k^ = co^). 

Based on an analogy with Schrodinger’s equation we now sketch why it is 
reasonable that one should have the de Broglie relation A = ^ . Our argument 
of course does not constitute a “proof” since in fact we consider the statement 
A = ^ as axiomatic. Let y/\(x) = A sin(kx + 5), y/ 2 (x) = Acos(kx + (5) so that 
in (1.4.9), (1.4.10) f\(t,x) = \i/i(x) cos(cot), f 2 (x,t) = xf/ 2 (x) sin(cot). If ii/(x) = 
either y/\(x) or y/iix), then y/ satisfies 

(/a"(x) + k^y/(x) = 0. (1.4.11) 



On the other hand, in the time-independent Schrodinger equation (equation (1.3.2)) 
regard E -V(x) as the classical kinetic energy T = ^ where p = mv is the mo- 
mentum of our particle of mass m with velocity v. That is, regard the total energy 
E as F -I- r, as in the discussion following (1.3.12). Then, if we (incorrectly) 
write equation (1.3.2) as 



W\x) + 



8;r"m p^ 



y/(x) = 0, 



h^ 2m 



(1.4.12) 
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using (0.0.2), and compare (1.4.12) with equation (1.4.1 1), we see that the square 
of the wave number of the traveling wave / in (1.4.8) corresponds to ^ = 
or that k corresponds to But k = ^ where A is the wavelength of the 
traveling wave. Thus y corresponds to or 2 corresponds to - = which 
suggests exactly the de Broglie relation 2 = ^ = ^. 

As P travels the circumference of T in Figure 1.4 its projection onto the diam- 
eter of r (on the ^axis) travels back and forth along this diameter. This back and 
forth motion is called simple harmonic motion (or simple harmonic oscillation), 
which we shall look at further in Chapter 2. simple harmonic oscillator. 

Examples 

1. Let / be given by 



f{t,x) = 6. 7 sin(4;rx + 7t) (1.4.13) 

where x is measured in centimeters and t is measured in seconds. Then / repre- 
sents a wave traveling along the x-axis in the negative direction (because of the 
plus sign -I-) with harmonic amplitude 6.7 centimeters, wave number k = 4;r, an- 
gular frequency co = 1 radians/sec, and phase constant 6 = 0. The frequency of 
the wave is v = ^ ^ Hz. The speed of the wave isC=| = ^ = ^ cm/sec, 

and the wavelength is 2 = y = | centimeter. 

2. The speed of a wave of frequency v = 18. 3 Hz and wavelength A = 30.9 cm 
is given byC = vA = (18. 3 ^)(30. 9 cm) = 565. 5 

3 . A golf ball of mass 50 grams moves at a speed of 60 feet/sec = ^ meter/sec 
= 1829. 3 cm/sec. Its de Broglie wavelength is given by A = -^ = 5 Qx 18293 

= 7. 24 X 10“^^ cm, which is too small to be detected. 

4 . An electron moves with a kinetic energy of one electron volt, i.e., 1. 60 x 
10”^^ ergs. What is its de Broglie wavelength? 

Solution: The electron mass is = 9. 11 x 10“^^ grams; see (0.0.5). The ki- 
netic energy T = v = => mgV = \JlTme = 

\/2 X 1.6 X 10-12 X 9. 11 X 10-28 g = V29. 192 X 10-2^ g|§ =5.4x10"^® 

g Therefore A = -^ = cm = 1. 23 x 10“^ cm. We note that by Ta- 

ble 1.1 this wavelength is in the x-ray range, which is considerably less than the 
range of visible light 4 x 10“^ - 7 x 10"^ cm (violet to red). 

1 .5 Heisenberg’s Uncertainty Principle 

Among the foundational principles of quantum theory one of the most celebrated 
is the Heisenberg uncertainty principle, discovered by Werner Heisenberg in 1927. 
The principle states that it is impossible to measure simultaneously and precisely 
the position and momentum of a particle. One can measure very precisely the po- 
sition X, or the velocity v (and thus the momentum p = mass x v) separately. 
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but there is a natural limit to knowing both the values of position and velocity si- 
multaneously and as accurately as one would desire, especially at the microscopic 
level. The very act of measuring accurately the position of an electron, for exam- 
ple, interferes with the measuring of its momentum. The limitation in simultaneity 
and precision of measurement of x and p can be ascribed to the wave nature of 
the electron, as we shall see. 

If Ax, Ap denote the uncertainty in the position x and the momentum p, re- 
spectively, of a particle, then Heisenberg’s principle is usually expressed analyti- 
cally as 



(Ax)(Ap) > (1.5.1) 

Thus as the uncertainty Ax in x decreases (i.e., as the position of the particle 
becomes more accurately known), the uncertainty Ap in p increases. Similarly as 
Ap decreases. Ax increases. So far Ax, Ap have not been carefully defined. This 
will be done in Chapter 6, where a proof of (1.5.1) will be taken up. In fact, there 
we will provide two proofs of the fundamental inequality (1.5.1). For now, let us 
consider how one might arrive at (1.5.1) by proceeding heuristically. 

As we considered a beam of light entering a narrow slit in Figure 1.1 (also cf. 
Figure 1.2), consider now a beam of electrons entering a narrow slit as indicated 
in Figure 1.6. 




Figure 1.6: Single slit diffraction of electrons 
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1. Quantum Mechanics: Some Remarks and Themes 



After passing through the slit the field of electrons is diffracted (i.e., is “spread 
out”) and eventually impacts the fluorescent screen throughout a range of x- 
values. It is because of the wave nature (not the particle nature) of electrons that 
this diffraction phenomenon is exhibited. Most of the electrons will land on the 
screen within the symmetric interval [-x\,xi] about the origin 0, where ±x\ are 
initial minimal points of an intensity distribution of illumination curve for elec- 
trons impacting the screen, as indicated in Figure 1.6. 

Suppose we follow the path of a single electron e entering the slit. As it enters 
the uncertainty Ax in its position x will be chosen to correspond to the width w 
of the slit (Ax w), which we could make arbitrarily small. As e passes through 
the slit and is diffracted in an arbitrary and unpredictable direction, an uncertainty 
in its velocity and momentum is introduced, and it is impossible to determine 
in advance at what point e will strike the screen. It has the best chance however 
of striking within the interval [-xi,xi]. Now for light of wavelength A passing 
through the slit one has the following simple diffraction theory result: 

sin^=— (1.5.2) 

w 

for 9 in Figure 1 .6. We consider 6 very small so that sin 6 w imO = ^ => x\ ^ 
Also now choose A to correspond to the de Broglie wavelength A = ^ of e. 
After leaving the slit the momentum of e acquires an x-component px (because 
of diffraction) whose magnitude we take as a measure of the uncertainty of e’s 
momentum: Ap ^ px. 

As Figure 1.7 indicates Ap Px = psinO ^ (by (1.5.2)) = Recalling 
that w « Ax we see that (Ax)(Ap) ^ h = Inh > |, which somewhat crudely 
establishes equation (1.5.1) for now, until a clean, rigorous argument in Chapter 6 
is presented. 

The uncertainty principle was stated here for the variables position and mo- 
mentum X, p, but it extends to other “conjugate” variables as well. For time and 
energy /, E, for example, one has 

(A0(A£) > ^ (1.5.3) 

for suitable definitions of At, AE. A physical meaning of (1.5.3) for example 
is that the energy of a molecule in an excited state for some time period cannot 
be exactly determined. Heisenberg’s principles spell the downfall of classical de- 
terminism: the future is not necessarily determined by initial data of the past — a 
matter which we return to later. 

1.6 Bohr’s Complementarity Principle 

The wave-like character of matter has been noted in this chapter — in discussions 
ranging from de Broglie’s wavelength to electron diffraction, for example, as il- 
lustrated in Figure 1.6. The dual character of an electron for example as wave- 
like or particle-like, as revealed in numerous experiments, or the dual character 
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Figure 1.7: Computing Ap 

of other physical entities, eventually led to the enunciation of a complementarity 
principle by Niels Bohr in 1928 — a year after Heisenberg uncertainty was discov- 
ered. Bohr’s principle, like Heisenberg’s principle, forbids a certain simultaneity. 
Namely, it states the impossibility of one observing simultaneously both the wave 
and particle-like nature of matter, and that both aspects must be taken into account 
in any attempt to completely describe a physical entity. 

Thus the wave and particle nature of matter complement each other. Whether 
one or the other of these natures is observed in some experiment depends on 
the type of experiment being performed. Any experiment designed to manifest a 
wave-like attribute can provide no manifestation of particle-like attributes — the 
converse being true as well. 

The profound principles of uncertainty and complementarity of Heisenberg and 
Bohr, with their weighty philosophic implications, are empirically rooted. They 
express fundamental limitations on human measurements and observations — lim- 
itations which reflect deep intrinsic realities of nature, and which cannot be sur- 
mounted by the construction of “better mouse traps.” 
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Equations of Motion in Classical 
Mechanics 



2,1 Introduction 

In classical mechanics one studies the motions of objects on a macroscopic level. 
These motions, of a particle or a system of particles, of the planets around the 
sun, or of a pendulum, for example, are governed by differential equations. These 
equations, or their solutions, are called equations of motion, which in earliest 
form go back to Newton. Other forms are due to Lagrange, Hamilton, Jacobi, and 
others. 

Newton’s basic law, his second law of motion, states that a force F acting on 
an object of mass m produces an acceleration a of that object (in the direction of 
F) of the magnitude \F\/m where |E| is the magnitude of F. Taking the notion of 
a force as fundamental or axiomatic one can use this law in fact to define mass : 
m = |E|/|a|. For an object moving along the x-axis, say, under the influence of F 
the acceleration is given in terms of the second derivative of its position x(t) with 
respect to time t. Thus the equation of motion takes the familiar form 



d^x 

= |F|. (2.1.1) 

We shall review some basic definitions and concepts of classical mechanics. 
Using equation (2.1.1), the chain rule, and the fundamental theorem of calcu- 
lus, we deduce, for example, the conservation of energy (along a trajectory) and 
explain why the f appears in the definition T = of the kinetic energy T 
of a particle with velocity v. Hamiltonian and Lagrangian forms of equations of 
motion will also be derived. 
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2. Equations of Motion in Classical Mechanics 



2.2 Motion in One Dimension 

Consider a particle of mass m > 0 moving in one dimension with equation of 
motion x = (where t = time), where <p', cp” exist and are continuous on M. 
The functions v = p', a = p" evaluated at t give the velocity, acceleration, respec- 
tively, of the particle at the instant t. Suppose the particle is under the influence 
of a continuous force field jF: jF is a continuous function on R such that F(x) = 
force at position x. 

By Newton’s second law, equation (2.1.1), 



F(P(t)) = ma(t) = 



Consider the function H :R^R given by 



Hit) 




cm 

F(x) dx. 

J fl=any constant 



( 2 . 2 . 1 ) 



( 2 . 2 . 2 ) 



If 



y/{t) = I F(x) dx (2.2.3) 

Ja 

then (as F is continuous) y/'it) = F(t) by the fundamental theorem of calcu- 
lus, and 

cm 

F(x) dx = y/ipit)). (2.2.4) 

Ja 

That is, by the chain rule (as y/, p are differentiable on M), 

H’(t) = • 2v(0v'(r) - Kf'mWit) 

= mvit)v'(t) - F(p(t))v(t) = mv(t)v'it) - mv'it)v(t) = 0. (2.2.5) 

That is, we have the following conservation law: 

I cm 

-mv(t)^ - \ F(x) dx = a constant (2.2.6) 

^ J fl=any constant 

for all t. The first term ^mv(/)^ is the kinetic energy of the particle at time t. The 
kinetic energy T of the particle, regarded as a function: M M of the velocity, is 
given by T(v) = ^mv^. 

Consider the second term. If we define V : R ^ E by 

F(x) = - [ F(r) dr (2.2.7) 

Ja 

we obtain a continuously differentiable function V such that V'(x) = -F(x); 
i.e., F(x) = -F'(x) =» F is derived from a potential V (or potential en- 

ergy function V). The second term in equation (2.2.6) is Vip{t)). If V\ is any 
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other continuously differentiable function on R such that F(x) = -Fj'(jc), then of 
course V{ =V+ constant. We can therefore state equation (2.2.6) as follows: 

Let V be any choice of potential energy function for F: K is a continuously 
differentiable function of R such that F{x) = -F'(jc); example: 



V{x) = - 



F{r) dr. 



(2.2.8) 



Then along the trajectory t the total energy E = T + F of the particle is 

conserved: 

1 9 

T{v{t)) + K(0(O) = -mv(ty + V(4){t)) = a constant for all time t; 



-mv{ty ■ 



cm 



F{x) dx = a constant for all time t. 



(2.2.9) 



We see the reason for the factor 1 /2 in the definition of T: it is needed in order to 
have H\t) = 0, and thus the conservation of energy. 

Consider any two instants of time ti, ^ 2 , say t\ < t 2 . Write 



m) 



F(x) dx = 






c4>(t2) 



F{x) dx + 



F{x) dx 



m) 

‘m) 



F{x) dx 



c^ih) 

Jo 



( 2 . 2 . 10 ) 



F(x) dx. 



By the conservation of total energy and equation (2.2.10) 






•m) 

. 0 



1 9 

F(x) dx = -mv(t 2 ) - 



m) 



F(x) dx 



1 o 1 , 

-mv(t2) - :^mv(tiy 



^m) 

^m) 



( 2 . 2 . 11 ) 



F{x) dx, 



which relates the change in kinetic energy to the work 

C<P(t2) 

F(x) dx 

im) 

done in displacement of the particle from 4>(h) to ^(ti) in the field F. 
From equation (2.2.1) 



( 2 . 2 . 12 ) 



'h 

F(^(t)) dt = mv(t 2 ) — mv(ti) (2.2.13) 

hi 

where mv(t) is the momentum of the particle at time t. One can regard the mo- 
mentum as a function p : R R of the velocity: p(v) = mv. Equation (2.2.13) 
expresses the fact that the change in momentum coincides with the time-integral 
of the force field. 
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2. Equations of Motion in Classical Mechanics 



We are given <p\ 0", F continuous on R with F{(p{t)) = m4>"(t). Again 
choose V continuously ditferentiable on R such that V'(x) = -F(x). Define 
T,V,H : R2 ^ Rby 



T(q,p) = 

2m 



P — 

r fr, 



V{q,p) = V{q\ 



H = T + V:H{q,p) = -^ + V{q) 
2m 



oq op m 



(2.2.14) 

(2.2.15) 



T is and V is class (as V is class C^) ==> H is class As 0, are 
class the function t m4>\t)) is continuously differentiable and 



at oq op 

m 

= = 0 , 

(2.2.16) 



which proves 

Theorem 2.1. t H{j>{t), m4>'{t)) is a constant. Also 

^i4>it),m4>'{t)) = ~{m4)'{t)), 

oq dt 

op dt 

The function H is called the Hamiltonian, and the equations of the system in 
Theorem 2.1 are Hamilton's equations. For examples involving these equations 
see Section 2.6 below. 

Example: Simple Harmonic Motion 

Suppose a point P moves counterclockwise around a circle of radius C with a 
constant angular speed of co radians per second. Consider the up and down motion 
along the x-axis of the projection x of P onto that axis. 

To find the equation of motion note that 

^ = sin ^ => 

X = C sin 0 = C cos ^ ^ 

^ _ 

dt 

e(t) = to + 9(0) => 



where 9 = 9(t). 

(2.2.17) 

for ^ ^ - 0(0). 



X = 4>(t) = C cos(cot - 6) 
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Figure 2.1: Simple harmonic motion (cf. Figure 1.4) 



is the desired equation of motion. 

The motion along the x-axis is an oscillation between x-max= C and x-min= 
-C. The maximum distance C from 0 is called the amplitude of the oscillation. 
The amount of time T required for one complete oscillation — say for a particle 
at X to move from x to x-max, then down to x-min and back up to x (which is 
the amount of time for P to complete one revolution) — is called the period of 
the oscillation: Given any time t, 6{t + T) = 0{t) + 1k\ i.e., (r + T)co + 0(0) = 
tea + 0(0) + 2n => Tco = In => T = In I cd, exactly as in Section 1.4 of 
Chapter 1 . As it takes T seconds for one complete revolution of P, the number 
of revolutions of P per second (possibly a non-integer) is v = 1/T, called the 
frequency of the oscillation; thus v = cof{2n). 

x{t) = Ccos(cot - S) => v(t) = -coCsin(cot - S) => a(t) = -co^x(t). 
That is, for V(x) = 2n^v^mx^, m = mass, V is the potential energy of 

our particle: V'{x) = c^xm => —V\x{t)) - ma{t). 

Example 

It is well known that a freely falling object, relatively near to the surface of the 
earth, experiences a constant acceleration due to gravity. The magnitude of the ac- 
celeration, denoted by g, is approximately 32 ft/sec^. The object also experiences 
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2. Equations of Motion in Classical Mechanics 



a force due to air resistance, but we assume here such a force to be negligible. 
Suppose we wished to determine the depth of a certain well. One way to proceed 
is as follows. Drop a stone into the well and carefully determine the number of 
seconds r it takes to hear the sound of the stone striking the bottom of the well. 
Suppose the speed of sound is a ft/sec; one knows that a is approximately 1100 
ft/sec. What is the depth d of the well in terms of r and c? 

To answer this question note first that choosing the y-coordinate as illustrated 
in Figure 2.2 we have by the preceding remarks 




(2.2.18) 



so that the equation of motion of the stone is 



y{t) = ^ + at + b. 



(2.2.19) 





0 






d 








bottom of the well 



y 



Figure 2.2: A well of depth d 



The initial velocity is y'(0) = a which we assume is 0. Also b = y(0) = 0 : 
y(t) z= gf => d = gf' /2 provides the amount of time t it takes for the 
stone to reach the bottom of the well. Since sound travels a ft/sec, once the stone 
hits the bottom an additional dj a seconds will be needed to hear the sound of it 
striking, as the sound travels upward a distance d feet from the bottom of the well. 
Thus T is given hy t = djo + t where gt^ / 2 = d, which means that d is subject to 
the quadratic equation 




( 2 . 2 . 20 ) 



The solutions are 



d = GT -\ 

g 





( 2 . 2 . 21 ) 
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Now the time t is positive: 



^ ^ 0- O' 

0> -t = T = — ± — 

g g 



2rg 



+ 1 



(2.2.22) 



which forces the choice of the minus sign in equation (2.2.21). The solution of 
the problem is therefore 



d = (7T+ — l / + 1. 

g g \ o- 



Note that equation (2.2.23) can also be written 



l2 



(2.2.23) 



d = 



or + 






(2.2.24) 



2.3 Motion in Three Dimensions 



We can extend the discussion of a particle of mass m > 0 to its motion in three 
dimensions (or in two dimensions). In this case we have the motion specified by 
three functions : E ^ M with continuous derivatives j = 1, 2, 3. That 

is, we have a function ^ : R ^ 0 = (^i, 02, 0s) such that at time r, 0(0 is the 

position (x, y, z) of the particle. The velocity v and acceleration a of the particle 

as functions of time are given as maps v, a : R v(0 = (0'i(0, 02(0, 0s(O), 

dcf 

<^(0 = (0'i(0, 02(0, 03(0)- One also writes V = (vi,V2,vs), a = ( 01,^2,03)- 
Assume that we have a continuous force field F : R^ R^: for the particle 
at position (x, y, z) the force acting on it is F(x, y, z) = (Fi(x, y, z), i^2(x, y, z), 
F^{x, y, ^)) where Fj : R^ ^ R are the (continuous) components of F. Newton’s 
second law ma = F now takes the form 



m(p'j(t) = i^y(0i(0, 02(0, 03(0) at timet, for; = 1,2,3. (2.3.1) 



We shall also assume F is derived from a potential V:V : R^ 
continuous first partial derivatives such that F = -YV ; i.e.. 



Fi=- 



dx ’ 





R is a map with 



(2.3.2) 



For motion in one dimension, the latter assumption is automatically fulfilled as 
we have seen. Now 0i,02,03 • R ^ R and K : R^ ^ R are continuously 
differentiable. Thus if we define /z : R R by h(t) = f^(0i(t),02(t), 0s(t)) we 
obtain (by the chain rule) that h is differentiable, and 



^ ^ ^ ^2(0 + ( m ) 4 > 3 ( t ) 

= -F, (0(0) 0'i(r) - F 2 (0(t)) 0' (0 - F, (0(0) 0' (t) 

= -m0"(O0i(O - - m03(O03(O by (2.3.1). 



(2.3.3) 
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Therefore define ff : R ^ M by 

H{t)=^-m{v(t)Mt)) + h{t) 

= + 4>2{tf + + h(t) 

to obtain 

H'(t) = ^-m + 20' (f)^"(t) + 20'3(t)0''(t)) 

- rn4>'lm\it) - - m0''(t)0' (/) = 0 

H{t) is a constant. 



(2.3.4) 



(2.3.5) 



That is, we have the conservation law (v(0, v(0) + V{^{t)) = a constant for 
all time t, for F = - VF. The first term here is the kinetic energy of the particle at 
time t. The kinetic energy T as a function ^ R of v is given by 

T{v) = IffJ {v, v> = 1/n (vj + v\ + v|) . (2.3.6) 

T can also be expressed in terms of the momentum p = mv of the particle: pj = 

mvj => 



2 2 
Pj _ mVj 

2m 2 

=> T(v) - |(vf + v^ + v^) = :^(p\ +pI+pI) = ^(P^P)- 



(2.3.7) 



2.4 Many Particles in Three Dimensions 

Consider now a system of n particles moving in three dimensions, with mj = mass 
of the jth particle. Let = (0^/\ ^2 03 ^^) : R R^ specify the position of 
the jth particle. It is assumed as before that continuous derivatives (^/■^ V^ 

/ = 1, 2, 3, exist. We simplify the notation by also writing = (xj, yj, zj) : 

X, = xj = 

yj = (0f )' 3^; = (02^V' (2.4.1) 

zj = {4>fr 

For each y, 1 < j <n, consider the yth particle under the influence of a continuous 
force field where now F^"^ = (fI^'\ F^\ F^^) : R^" R^ (instead of R^ 
R^); also write F^^^ = (Xj,Yj,Zj); thus Xj,Yj,Zj : R^" ^ R are continuous 
functions. If : R R^ denote the velocity, acceleration functions of the 
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jih particle (thus = (xj, yj, Zj), = (xy, Zj) by the above notation) then 
Newton’s second law takes the form 

mjxjit) = , «/-'"’ W) (2-4.2) 

= yi( 0 , Z\{t ), . . . , x„( 0 , y„( 0 , Zn(t)), 

and similarly 

mjyj(t) = Yj(xi (0, yi (0, (t), . . . , x„(0, y„(0, Zn(t)), ^ 

rrijZjit) = Zy(xi(0,yi(0,^i(0,..-,->Cn(0,J«(0,^Ai(0), 



for 1 < 7 < n. 

As before we shall be interested in the case when a potential energy function V 
exists for our system: V : R has continuous first partial derivatives such 

that 






dx/ 





(2.4.4) 



at (xuy\,z\,X 2 ,y 2 ,Z 2 .---,Xn,yn,Zn) e 1 < ; < «. We have ^3^^ 

^2”^ ^3”^ : R ^ E, F : R are continuously 

differentiable. If we therefore define /i : R R by 



h{t) = v(^4>T(t),4>2\t),4>Tit),- 

we obtain by the chain rule 



=V{m), 

(2.4.5) 



3n 

h'(t) = ^(D,v)imwt) 

p=l 

= + ^(4>m<p2'’y(t) + ^imM^^yit) 

dxi dyi dz\ ^ 

+ ^(mM?y(t) + ^mmfyit) + ^mmfyit) 

dx2 dy2 dz2 

+ ... 

+ ^mmf^yit) + ^mmfyit) + ^imMfyit) ( 2 . 4 . 6 ) 

dx„ dy„ dzn 

= -Xi mm, it) - Y, mm, (o - mm, (o 

- X2{4>{tm2(t) - Y2(m)hit) - Z2(m)z2{t) 



- x„(4>{tmnit) - - z„im)znit) 

= -miXi(0xi(f) - miyi(0yi(0 - miZi(t)zi{t) 

- m2X2{m2{t) - mMtmit) - m2Z2it)Z2(t) 

- -m„x„{t)x„{t) - m„y„{t)y„{t) ~ m„Zn{t)z„{t). 
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Hence, for H :R-^R defined by 
1 " 

H(t) = ^ ^ v'-'HO) + Ht) 



7=1 



1 " 

= + (^7(0)^) + m, 



(2.4.7) 



7=1 



H'(t) = 0 ==> the conservation law: 



i v^\t)} + V V), 4'*(')> • • • - <^.f (o) 

y=i 

(2.4.8) 

is a constant for all t. This generalizes (2.3.5). 

The kinetic energy T : M of the system as a function of its velocities is 

given by 



T(v('), . . . , v<”>) = T{v[^\ vf , V®, vf , vf , . . . , vf, vf ) 



7=1 



= 5Z"«7((v|'V + (v''V + (vf)^), 



7=1 

0 ) Jj) Jj)-, 



(where we write = (v^ \ v^, V 3 )) or 



Inij 






U)vi ' 



(2.4.9) 

(2.4.10) 

(2.4.11) 



(2.4.12) 



for = nijV^^^ the momentum of the jth particle. Definition (2.4.9) is motivated 
by the first term in (2.4.8). 

We have shown in general that the definition of the kinetic energy function 
is motivated directly by a simple conservation law (which requires the factor ^) 
based on elementary calculus: Energy is conserved along a trajectory. 



2.5 Coulomb and Gravitational Potentials 

Consider a system of two particles : « = 2. In the present example we will de- 
,fine force fields F^-^\ j = 1,2, on an open set U C R^ (rather than on 

Let U^= {{xi,yuZi,X2,y2,Z2) e R®|(xi,yi,?;i) / (x2,y2,Z2)}- If we write 
(xi,yi,Zi,X2,y2,Z2) - (PuPi) e x and ;ri, 7 r 2 : R^ for the pro- 

jections (pi,P2) PuP2, then as 7 ri,;r 2 are continuous (and as R^ is Hausdorff), 
= {x G R^| 7 Ti(x) = 7 r 2 (x)) is closed in R®. That is, [/ = R® - VL is open. 
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For real constants q\,q 2 fixed define :U (continuous) by 



F^^\xi,yi,Zi,X2,y2,Z2) = 



mgijjxuyuzi) -{x2,y2,Z2)} 
\{xi,yuZi) - (x2,y2,Z2)\^ 



(2.5.1) 



and define :U-^R^ (continuous) by F® = -F*’\ 

The physical meaning here is the following. Think of qj as the “charge” of 
the jth particle. For particle one at position (xi,yi,zi) and particle two at po- 
sition (x 2 ,y 2 ,Z 2 ), Coulomb’s inverse square law (compare equation (0.0.4) in 
Chapter 0) of electrostatics says that the force on particle one due to particle two 
is F^'\xi,yi,zi,X 2 , y 2 , Z 2 ), and the force on particle two due to particle one is 
F^^\xi,yuZi,X 2 ,y 2 ,Z 2 ) = -F'~^\xuyuZuX 2 ,y 2 ,Z 2 )- Note that 



\F^^\xi,yuZuX2,y2,Z2)\ = \F^^\xi,yx,z\,X2,y2,Z2)\ 

\m2\ [(xi - X2f + (yi - y2f + (zi - Z2?] " 

l(xi,yi,?:i)-(x2,y2,^2)P = [(xi -x2)^ + (yi -y2)^ + (z\ -^ 2 )^]^ 

^ \g\Q 2 \ 

(xi - X 2)2 -I- (yi - y2f + (t:i - 5:2)^ 

^ kiftj 

[ distance from (xi, yi, zi) to (x 2 , y 2 , Z 2 )]^ ’ 

(2.5.2) 

which explains the phrase “inverse square.” In (2.5.1) we choose the constant k 
in (0.0.4) equal to 1. 

Writing F^) = (Xj,Yj,Zj), where Xj,Yj,Zj : U ^ M are continuous func- 
tions, we get X 2 = —X\, I 2 = —Y\, Z 2 = —Z\, with 



^i(xi,yi,t:i,X2,y2,t:2) 

Ti(xi,yi,zi,X2,y2,^2) 

^i(xi,yi,Zi,X2,y2,Z2) 



4lg2(Xl -X2) 

[(xi - X 2 Y + (yi - y 2 )^ + (zi - Z 2 )^]^ 

4i42(yi -y2) 

[(xi - X2)2 + (yi - y2)2 -I- (zi - Z 2 ^f^ 
^I^ 2 (Zl -Z 2 ) 



[(Xi - X2)2 + (yi - y2)2 + (zi - Z2)^]2 



(2.5.3) 



on U. We can construct a potential energy function V : U R for our system by 
defining 



F(xi,yi,2:i,X2,y2,<:2) 



£192 

V (xi - x 2)2 + (yi - y 2 )^ + (^1 - Z 2 )^ 



(2.5.4) 
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Indeed, 



■^(xi,yi,zi,X2,y2,Z2) 

dxi 



= 



H) 



[(xi - xir + {yi 



OX\ 



y2f + izi - Z2f] ^ 2 (xi - X2) • 1 



(2.5.5) 



Similarly 





dV _ dV 
dx 2 dx\ ’ 



dy2 dyi ’ dz2 dzi ’ 



(2.5.6) 



dV dV dV 

^ -T-= -^1 = ^2, = -Yi = 72 , = -Zi = Z2, 

dx2 dy2 dz2 

so that for 7 = 1,2 



x=-^ 

^ dxi’ 



^ dy/ 



Zj = - 



— 

dZi' 



(2.5.7) 



as in (2.4.4). 

If q \ , q 2 are masses, rather than charges, then V is the gravitational poten- 
tial, up to a constant (the universal gravitational constant), and the inverse square 
law (2.5.1) is Newton’s universal law of gravitational attraction. In (2.5.1) we 
have omitted an appropriate constant, which some readers might wish to incor- 
porate for later application. This is not a problem since one can replace q\, for 
example, by a multiple of itself. 



2.6 Hamilton’s Equations of Motion 

We define the Hamiltonian of our system of n particles moving in as the total 

energy function H : R^" x R^" R given by 










(//y (Pi'y (P3^y 
— — + — — + — 
2mj 2mj 2mj 



+ V(qu...,q3n) (2.6.1) 



for pW = € R^ cf. equations (2.2.14), (2.4.12). Let 

m = (^>1‘\0, <p2\t), 



(2.6.2) 
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= (2.6.3) 

as before, and define 

. . . , (2.6.4) 

= ifnixi{t),miyiit),mizi{t), . . . ,m„x„{t),mnyn(ty>n„Znit)) (2.6.5) 



by (2.4.1). Then 



j=i ^ 

= H(t) 



rrijXjity rrijyjity 

+ _L + . 



2nij 



2mj 



2mj 



+ Vim) 



( 2 . 6 . 6 ) 



(by (2.6.1) and (2.4.7)), which as we have noted in (2.4.7), (2.4.8) is a constant 
for all time t. 

Moreover for = (q[^\ q\^^) e we have 



^im,m4>'it)) = -^Am) = -Xjim) 

= -trijXjit) = -j^[mA4>^y'it)], 
by (2.6.1), (2.4.4) and (2.4.2). Similarly 



= -mjyjit) = -j^[mj{^^^yit)], 

■^(^(O,WJ0'(O) = -mjZjit) = -j[mji4>^fyi1)]. 



On the other hand 
dH 



mji4>fyit) d U) 

mym4>'it)) = — = -[<^>^’(0]. 



dp\ 



,0) 



Similarly 



-^(^(o,m^'(0) = 

dpW dt 

^^im<m4>'it)) = ^\y\t)]. 

dt 



(2.6.7) 

( 2 . 6 . 8 ) 

(2.6.9) 

( 2 . 6 . 10 ) 

( 2 . 6 . 11 ) 

( 2 . 6 . 12 ) 



Thus we have the following generalization of Theorem 2. 1 . 
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Theorem 2.2. For a system of n particles moving in space, let 

: M ^ specify the position of the jth particle with mass mj. Assume 
as above the system is governed by a potential V : R^” R; see (2.4.4). Let 
H : R^" X R^" R be the Hamiltonian given by (2.6.1), and let (p, mp' be de- 
fined on R by (2.6.2) and (2.6.4). Then we have that t H{p(t),mp\t)) is a 
constant function on R (conservation of energy along a trajectory), and Hamil- 
ton's equations of motion for the system: 

% d 

dpi' 

fork = 1, 2, 3, . . . , . . . ,/">) e 1 ^”xR3« with = {qf, qf, qf), 

= {pi\pf,pf),j = 1,2,3,. ..,n. 



Examples 

As a simple example consider a ball of mass m thrown vertically up from a 
height ^0 feet above the ground represented by the xy-plane. As we have noted, 
a freely falling object experiences a constant acceleration, due to gravity, of mag- 
nitude g = about 32 ft/sec^, assuming that the fall is relatively near the earth’s 
surface, and that the force exerted on the object due to air resistance is negligi- 
ble. This means that the ball will experience a force F = —mgic by Newton’s 
second law where k = (0,0, 1) is the unit vector along the z-axis. Regarding 
E as a function (Fi,F 2 ,F 2 ) : R^ R^, where Fi(x,y,z) = F 2 (x,y,z) = 0, 
F 3 (x, y, z) = -mg, we clearly have that F is derivable from the potential energy 
function F : R^ ^ R given by V(x, y, z) = mgz : 0 = = Ei, 0 = = F 2 , 

=^3. 

dz ^ 

The corresponding Hamiltonian TT : R^ x R^ R is given by (2.6.1): 

^2 2 , 2 

/?1 + /?2 P3 

Hiqi,q2,q3,Pi,P2,Pi) = z + rngq^. (2.6.14) 



Let ^(0 = (P\(t),p 2 {t),p 3 (t)) specify the position of the ball at time t. Then 
Hamilton’s equations (2.6.13) read 



oqk at 

apk dt 



(2.6.15) 



for/t = 1,2,3. Now || = 0 for fc = 1,2, ^ = mg, ^^{q\,q 2 ,q 3 ,PuP 2 ,P 3 ) = f 
forfc = 1,2,3 ^ 



mf>kit) = 0 



for fc = 1,2, 



(2.6.16) 
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m^3it) = -mg, (2.6.17) 

4(0 = for A: = 1,2,3. (2.6.18) 

m 

Integrating equations (2.6.16), (2.6.17) we obtain the equations of motion of the 
ball 



^i(0 = 

^2(0 = ^2^ + ^ 2 , 

gt^ 

^3(0 = — Y + 



(2.6.19) 



Taking initial conditions ^(0) = (x^, yo, Zo), ^'(0) = (vi, V 2 , V 3 ) we obtain in the 
end 



(pl(t) = Vit + Xo, 

02(0 = ^2t + yo, 

gt^ 

03(0 = ^ + V3t + Zo, 



( 2 . 6 . 20 ) 



as is well known. The equations in (2.6.20) are independent of the mass m. 

In another example consider the potential F : E given by V (x, y, z) = 

^{x^+y^+z^), where a > 0 is some fixed constant. The corresponding force field 
is F = -VF : F(x, y, z) = -a(x, y, z). Suppose a particle moves on the surface 
of a cylinder of radius r, as pictured below, under the influence of this field. What 
are its equations of motion? 

By (2.6.1) the Hamiltonian iT : E^ x E^ ^ E is given by 



1 ^ a 

H{q\,q2,qi,P\,P2,Pi) = P] ^ (2.6.21) 

;=i 



as on the cylinder. If 0(0 = (0i(0. 02(0. 03(0) denotes the position 

of the particle (of mass m) at time t, then Hamilton’s equations of motion are as 
in (2.6.13), where now = o for fc = 1, 2, ^ = a? 3 , ^ for fc = 1, 2, 3 

=> ^kit) = 0 for fc = 1,2, Integrating, we get 



4>k(t) = Okt + h, k = 1,2 (2.6.22) 

9 a 

^3(0 = acos(cu0 + ^sin(o;r) for a> = —, since a > 0. 

m 

Since 0(0 is a point on the cylinder we must have {a\t + />i)^ + {a2t + Z>2)^ = 

r^>^t:(al-\-al)f+2{a\b\-\-a2b2)t+bl-\-bl-r^ = 0 ==> ^i, ^2 = 0, +^2 = r^. 

Therefore 0i(O = ^1, 02(0 = h where b\-\-b\ = r^. Also the ^-component of 
the motion is simple harmonic motion with frequency v = ft?/2^; cf. Section 2.2. 
We can write 03(/) = C cos(cut - b). Namely a = C cos S, b = C sin b for some 
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z 




Figure 2.3: Particle on a cylinder 



5, for C = +Va^ + ■ {a/Cf + {hjCf = 1. Then indeed 

C cos{cot - S) = C[cos{cot) cos d + sin(cu0 sin 5] 
= acoscot + b sin cot = 03 ( 0 - 



(2.6.23) 



2.7 Lagrange’s Equations of Motion 

We have enough notation established to easily derive Lagrange’s form of the equa- 
tions of motion of a system of n particles in M? under the influence of a potential 
V : R. Define the Lagrangian L : R^” x R^" R of the system by 



def 1 



J=i 






(n)\ 



(2.7.1) 
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by (2.4.11), where as before = (Qi\q 2 ^ G 
Then 



dL 



and by (2.4.4) 
dL 



dq^ 






dv 



in) „(1) 






(J) 



(2.7.2) 






for k=i 

Yjy^\...,q^”^) forfc = 2 

Zj(q<^^\...,q^y forfc = 3 



. (2.7.3) 



Then for 4>(t) defined by (2.6.2), (2.6.3) and <p(t) defined accordingly 

-^(^(0,^(0) = and 

dvr 



rrijXjit) forA: = l 

rrijyjit) for /c = 2 

rrijZjit) for/c = 3 

by (2.4.2), (2.4.3), and (2.7.3). That is, by (2.4.1) 
dq^ 



dL 

p.ii) 



mim) = 



(2.7.4) 



(2.7.5) 



(2.7.6) 



By (2.7.4), (2.7.6) we therefore obtain 

Theorem 2.3 (Lagrange’s equations of motion). As before, let ^(t) be given 
by (2.6.2) where yj(t), Zj{t)) specifies the position of the jth par- 

ticle of a system of n particles moving in R^. Let L be the Lagrangian of the 
system, given by (2.7.1). Then 

^ im, m) - im, m) = o (2.7.7) 

for A: = 1, 2, 3, 1 < 7 < «. 



Example 

Suppose for example a particle of mass m moves in under the influence 
of the field F : ^ given by F(x,y,z) = {mx,my,mz)\ here n = \. 

Then F = -W for V{x,y,z) = -f (x^ + + z})^ Equations (2.3.1) become 

'w^J(0 = 'w0y (O, 1 < 7 < 3, so that the equations of motion are 

^j{t) = Oj cosh t + bj sinh t, I < j <3. (2.7.8) 

The velocity vector is 

v(o = (<#>; (0,02(0, ^3(0) 

= (a\ sinh t + b\ cosh t, 02 sinh t + b 2 cosh t, sinh t + b^ cosh t). 
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Equations (2.7.8) can easily be derived as well by an application of Theorem 2.3. 
Namely, by (2.7.1) 

3 3 

i(?l,92,?3,Vl,V2,V3) = ^ X f (2.7.10) 

^ J=1 ^ J=i 

Hence for = 1,2,3 

II (2.7.11) 

dqk 



Differentiating the first equation in (2.7.1 1) with respect to t we get indeed 
m4)k(t) = 0 by Theorem 2.3, as before. 

We can relate the Hamiltonian and the Lagrangian: By (2.7.1), (2.7.2), at a 
point . . . , q^^\ . . . , v^")) 6 x R^" 



z 

y=i 



dL 



dv 






dL 






dv. 



U) ^2 + 



dL 



dv. 



iJ) "3 



O’) 



-L 



= X + mjiv^^y 

j=l ^ 

- 5 



+ v 



=i 






2m i 



= ..., q^">, mi .... m„v^"0, 



2m, 



(«) ^..,(1) 



2m i 






+ V 



(2.7.12) 



where mjv^^ = {mjV^\ mjv^^ , mjv^^^). In particular since the potential V does not 
appear in the final form of (2.7.12) which relates H to L, one can use this for- 
mula as motivation for a definition of H, given L, in situations (more generally) 
when V might not even exist. We will encounter such a situation, for example, in 
Chapter 8, where the motion of a charged particle in an electromagnetic field is 
studied. Also note that by (2.7.2), (2.7.3) given a Lagrangian L in general we can 
regard the ^ as “generalized momenta,” and the ^ as “generalized forces.” 

Thus, for example, we could define the momentum of some yth particle, for 
some abstract Lagrangian L, by 



(/) def dL I dL dL dL ^ 



(2.7.13) 



The reader familiar with the calculus of variations recognizes equations (2.7.7) 
as Euler-Lagrange equations — equations satisfied by a local extremum 4>(t) of a 
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suitable functional S. Namely, given two fixed values of time t\, ti, say t\ < t 2 , let 
be the space of continuously differentiable functions on [^ 1 ,^ 2 ]* Equip 
the Cartesian product ^ 2 ])^ with the norm 



ll</>ll = + (2.7.14) 

j=\ 



where 1 1 • 1 1 00 is the sup norm. Given a fixed continuous function £ : E x M"* x R"* ^ 
R of class in the variables (x, y) G R"* x R"* on [^ 1 ,^ 2 ] x R'” x R'”, define a 
functional 






(2.7.15) 



by 



S{4>)^ 






(2.7.16) 



which we also write as 



c{t,m.(P\t)) dt. 



Fix two points A = (Ai, . . . , A^), B = {B\, . . . , B^) G R'” and let 

= {^ e {C\tuti\r\ 4>j{h) = Aj,4>j(t2) = Bj for 1 < 7 < m). (2.7.17) 



Also let (t, xi, . . . , Xm, yi, • . . , y^) denote the typical point in R x R"* x R'”. Given 
the above assumptions on £, one can show that 

dyj 

is automatically continuously differentiable on [^ 1 ,^ 2 ], for ^ G ^ 2 ])^”, and 

that if S has a local extremum (maximum or minimum) at a point ^ G Q.a,b, then 
0 satisfies the Euler-Lagrange equations 

= i<j<m ( 2 . 7 . 18 ) 

at ayj oxj 

on [tut 2 l 

As an example, for the system of n particles moving in as above, choose 
m = 3n and define £ : M x x E^" E by 

£(U<»,...,9(''>,v<»,...,vW) 

= Z,(9('%...,^'"\v^‘\...,v("») (2.7.19) 

= r(v<»,...,v<">)-F(4<‘),...,9<"»), 



as in (2.7.1). The corresponding action functional or action integral S in (2.7.16) 
is given by 

s{4>) = \ dt (2.7.20) 

Jfl 
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with In this case the Euler-Lagrange equa- 

tions (2.7.18) coincide with the Lagrange equations (2.7.7), and we have at hand 
Hamilton's principle: Among possible paths ^{t) G (C^[^i, ^ 2 ])^” subject to 
4>f{h) = Af, = Bf, 1 < fc < 3, 1 < ;• < n, where A = , A^), 

B = , 5^”^) G are fixed, over a fixed interval of time ti < t < ^2 the 

actual motion of the system is along a path <p(t) which renders a stationary value 
of the action integral S in (2.7.20); i.e., the motion is described by a path 4>(t) 
which is a local maximum or minimum of S. 



2.8 Center of Mass Coordinates 

Consider again n = 2 particles in with masses mi , m 2 . For a point (xi , yi , , X 2 , 

y 2 , Z 2 > € E^ define {Ri , R 2 , R 3 ) by 



El = Xi - 

E2 = yi - 
Ri = zi- 



m 2 

m\ + m 2 
m 2 

m\ m 2 
m 2 

m\ + m 2 



(Xi - X 2 ) 



(yi - yi) 

izi - Zl)- 



That is, for R = (Ri, R 2 , R 3 ) we have 



( 2 . 8 . 1 ) 



(mi + m2)R 

= (mi + m 2 ) 



(xi.yi.^i) - 



m 2 



m\ + m 2 

= mi(xi, yi, zi) + m2(x2, y2, Z2); 



((xi,yi,zi) - (X 2 ,y 2 .^ 2 )) 



(2.8.2) 



El = 



m\Xi -I- m2X2 
mi -f m 2 



Ri = 



myi + m2^2 
mi + m 2 



R3 = 



mill +m2Z2 
mi -1- m 2 



(2.8.3) 



We obtain a map O : R^ R^ by setting 0 (xi, yi, ^i, X 2 , y2, Z2) = (^1 - ^2, yi - 
yi,Zi - ^ 2 ,^i,^ 2 ,i^ 3 ). Write O = (gug 2 ,g 3 ,g 4 ,g 5 ,g 6 ) where gj : R^ ^ R are 
maps. Thus gi(xi,yi,zi,X 2 , >^ 2 ,^ 2 ) = xi -X 2 



II 


dgi 


dX2 


dg2 . 
dyi ’ 


dg2 


dy2 


dg3 , 
dzi 


dg3 


dZ2 



with other first partials of gi = 0, 

with other first partials of g 2 = 0, (2.8.4) 

with other first partials of g^ = 0, 



and 

^ = 1 - ^2 _ rni 

dx\ mi + m 2 mi -h m 2 ’ 



(2.8.5) 
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dg4 _ rri2 
dx2 mi + m 2 ’ 

^ = 1 _ ^2 

dyi mi + m2 

dg5 _ m 2 
dy2 mi + m 2 ’ 

^ = 1 _ ^2 

dzi mi + m 2 

dge ^ m 2 
dz2 mi+m2’ 



with other first partials of g 4 = 0, 
m\ 

mi + m2’ 

with other first partials of g^ = 0, 
m\ 

mi + m2’ 

with other first partials of g^ = 0. 



Let / : ^ R have continuous first and second partial derivatives. Define 

F : R^ R by F(a) = f(g\(a ), . . .,geia)) = /(0(a)), a G R^. By the chain 
rule 



{DiF)(a) = J(i),/)(0(a))(Agy)(a) for 1 < i < 6. (2.8.6) 

7=1 

Using the preceding formulas for the partial derivatives Djgj we get 

dF df df mi 

— (a) = ^(0(a)) + ^((D(a))-^ (2.8.7) 

dxi ox\ 0X2 m\ + m2 

dF df df m\ 

— {a) = ^(0(u)) + 

dy\ dyi dy2 mi + m2 

dF df df mi 

— (cz) = ^ma)) + ^(0(a))— ^ 

dzi dzi dzi mi + m2 

dF df df m2 

-^{a) = -^(0(a)) + ^(0(a))— ^ 
dx2 dx\ dx2 mi + m2 

dF df df m2 

^{a) = -f-ma)) + 

dy 2 dyi dy 2 mi + m2 

dF df df m2 

^ia) = -f-ma)) + ^(0(a))— 

0Z2 ozi 0Z2 m\ + mi 

Replace / by ^ (since / has continuous second partial derivatives). That is, 



let F, = ^ o O, F4 = ^ o O to get by (2.8.7) 



(u)=- 4(0(n)) + 



dxf 



dFx 

dxi 



^(n) = 

dx\ dx\dxi 



dxidx\ 



(0(n)) 



mi 



(0(u)) + -^(O(u)) 



d^F dV 

— (u)=^(<I>(a)) + 



dxf 



dx\ 



dx2 
dxidx\ 



ma)) 



dV 

dxidx2 






dxl 



m\ + m 2 
mi 

mi + m2’ 
m\ 

m\ + m 2 

mi 



( 2 . 8 . 8 ) 



mi + m2 



mi 



mi + m2 
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Similarly let E 2 = ^ ° 4>, Es = o <D: 






dF2 d^f 

-^{a) = -^ma)) + 

<?yi dy; 

dp5 d^f 

dyi dyidy2 dy^ 



dyidyi 

,2 
2 



d^F 






—ia) = -^ma)) + 



dyt 



dyf 



dyidyi 



ma)) 

(4>(fl)) 

(4>(n)) 



m 



mi + m 2 

m 

m\ + m 2 ’ 
mi 

mi + m 2 



+ 






^yi<?y2 



dy. 



mi + m 2 



AlsoletF 3 = f oO,f6 = g°0: 



d^f 

(a) = -^ma)) + 



m 

dzi 

—{a) = 
dz\ dz\dzi 



dz\ 

d^f 



dzidzx 



(<D(a)) 



m\ 






(0(a)) + ^(<6(a)) 



d^F 



d^f 



dzi 



{a)=-^ma)) + 



dZ2 

d^f 



dz\ 



dZ2&Z] 



(0>(n)) 



mi + m 2 

mi 

m\ +m2’ 
mi 

mi + m 2 



dz\dz2 






(0(a)) + -^(4.(a)) 



mi 



dz. 



m\ + m 2 






^ W 1 +W 2 

dx2 



mi 



mi + m 2 



mi 



mi + m 2 



^ o o + - — — o O , 

dx] dx20Xi mi + m2 



we get 

— 

dxl 



f f m 2 

{a) = -4(0)(n)) - t-4- (<!>(«)) 



dx, 



dx2dxi 



+ ^(a)- 



m2 



mi + m2 dx 2 mi + m2 ’ 



where 



That is, 



djA^a) = -^1— 

dx2 dxidx2 



(0(a)) + ^(<&(a)) 



m 2 



mi + m 2 



d^f aV m 2 

—(a) = -4(0(^)) - ^4-(0(a)) ' 



axo 



axf 



dx2dxi 



mi + m 2 






axiax2 



axf 



mi + m 2 



m 2 



mi + m 2 



(2.8.9) 



(2.8.10) 



( 2 . 8 . 11 ) 

( 2 . 8 . 12 ) 

(2.8.13) 
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By (2.8.7) 



dp2 dV 

-^{a) = ^(<D(a)) + -^(0(a)) 

oyi dyl oyzdyi 



m2 



dyl 
d^F . 
dyl 



{a) = 



dV dV 

•' -((D(a)) + -4(3>(a)) 



dyidyi 






dV 

(a) = ^(4>(a)) - 



dy\ 



dV 

dyidyi 



(4>(fl)) 



mi + m2 
rri2 

m\ + m2 
m2 



m\ 4 - m2 






dyid^2 



dy. 



mi + m2 



m2 



mi + m2 

(2.8.14) 



Finally, 



df3 






^(a) = V(‘^>(«)) + 



dZ2 



dt 



dV 

dz2dzi 



(0(a)) 



ffl2 






dZ2 

d^f 

dz? 



d^id5:2 dzj 

dV dV 

{a) = ^( 0 (a)) - 



+ 



ma))- 



m\ + m2 
rri2 

m\ + m2 
m2 



d^f 

dz\dz2 



dzidzi ' m\+ m2 

d^f m2 

(0(fl)) + -4(0(a)) 






mi + m2 



m2 



m\ + m 2 

(2.8.15) 



Therefore, 



1 

mi 



6^F ^ ^ d^F ^ ^ fiF^ ^ 

dx] dyf dz] 



\mi m2/ 






1 

+ — 
m2 



^ d^F ^ ^ d^F ^ ^ 
— t{(i) H r(«) H t{o) 



dxi 



dyl 






, (<D(a)) + ^(0(a)) + ^(0(a)) 
dx, dyf dzj 



+ 



1 

mi + m2 



dV 



dV 






^(<D(a)) + ^(<D(fl)) + ^(4>(a)) 



dxi 



dy: 



dz 



dzi 



(2.8.16) 



We have assumed / : -> R, but we could have taken the domain of / to be 

any open subset D of R^. 

Proposition 2.1. More generally suppose W, D C R® are open sets and O : 
W ^ D, where 0 is defined following (2.8.3). Let / : D ^ R have continuous 
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first and second partial differentials. Define F on W by F(a) = f i^{a)) for 
a eW. Then formula ( 2 . 8 . 16 ) holds for a e W: 



1 

m\ 



^ d^F ^ ^ 

— t(«) + — 7(“) + — 7(«) 



dx 



dyt dz\ 



1 

+ — 
nt2 



— 7(0) H r(a) H r(a) 

dx2 dy^ dz2 



\mi mij 



d^f 



d^f 






^(0(a)) + ^(0(a)) + -^ma)) 



dx: 



dyt 



1 



mi + m 2 



d^f 



d^f 






-^ma)) + -^(0(a)) + -^ma)) 



dxi 



dy^ 



dz 



( 2 . 8 . 17 ) 



Proposition 2 . 2 . O : ^ is onto and one-to-one. 

Proof. lfa = (a\,a2, ^3, 04, as, ae) G R^ is given, set 
m 2 

xi = ai + a4, X2 = xi- a\, 

m\ + m 2 

m2 

yi = ^ Ci2 + as, y2 = yi-ci2, ( 2 . 8 . 18 ) 

mi + m2 
m2 

Z\ - CI3 + a^, Z2 = Zl - fl3- 

mi + m2 



Then (xi,yi,^i,X2, ^2,^2) € R^ such that 0(xi,yi,zi,X2,y2,^2) = a. If 0 (x) = 
O(x'), X, x' € R^, then 



Xi — X2 = Xj — X2, 

Ri = R[, 

i.e., R\ = R[ 

m2 



y\ 



■ y2 = yi - ^2’ 

i^2 = R' 2 ^ 



Zl Z2 ~ Zl ^2’ 
and Rs = R'^ ', 



m2 



Xi — (xi - X2) = x\- 

mi + m2 mi + m2 



Xi = X. 



(x'l - x'2) 

since xi - X2 = Xj - x'2. 



( 2 . 8 . 19 ) 



( 2 . 8 . 20 ) 



Similarly R2 = R'2 and y\ - y2 = y\ - y'2 => = y'v ^3 ~ ^3 ^1 “ ^2 = 

z'l - z'2 zi= z\. Then X2 = x'2, y2 = y'2. Z2 = 4 ( 2 . 8 . 19 ) => x = x'. 

Corollary 2 . 1 . O : R^ R^ is a diffeomorphism. For a = («i, «2, a^, 04, as, 
a(f) € R^ 0“H«) = (^byb^b^2,y2,^2) ^ith 
m2 

Xi = ; ai + a4, X 2 = xi- ai, 

m\ + m2 
m2 

yi = — ^ — 02 + 05 , yi = y\- oi, (2.8.21) 

mi + m2 
m2 

Zl = ; as + ae, Z2 = Zi - a^. 

mi + m2 

□ 
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The passage from the coordinates (x\, yi, zi, X2, yi, Z2) to the coordinates 
yi, Zi, X2, yi, Z2), C2i\\Qd Jacobi or center of mass coordimtQS, is y try use- 
ful in dealing with Coulomb potentials — as we shall see in the study of the hy- 
drogen atom, for example. Formula (2.8.17) will permit the transformation of 
Schrodinger’s differential operator. 

One can formulate classical mechanics, as is well known, in the general frame- 
work of a cotangent bundle over a smooth manifold. Some readers may wish to 
pursue this elegant point of view, which we have not attempted to touch on here 
as it will not be needed. 




3 



Quantization and the 
Schrodinger Equation 



The basic equation of quantum mechanics is the Schrodinger equation which ex- 
presses the wave function y/ of 3. quantum system as an eigenfunction of a quan- 
tized Hamiltonian operator H: Hi// = Ay/ where the (real) eigenvalue A is the 
quantum energy of the system in the state y/; see equations (1.3.2), (1.3.4). Em- 
bodied already in this equation is the basic quantum mechanical principle that 
quantum energies cannot take on arbitrary values but are quantized: they are given 
by a discrete set of eigenvalues of a suitable second-order differential operator. 
This mathematical phenomenon of the discreteness of eigenvalues explains, for 
example, the observed discreteness of absorption and emission atomic spectral 
lines; compare remarks in Sections 1.2 and 1.3 of Chapter 1. The Schrodinger 
theory, and the equivalent theory of Heisenberg, Bom and Jordan, represents a 
distinct advancement of the Bohr theory. Some early basic papers on quantum 
mechanics are compiled in the book [82], which includes a historic introduction 
by B. van der Waerden. Also see [6, 8, 9, 24, 75, 76]. 

As indicated earlier, important physical information (the probability that a par- 
ticle assumes habitation of a particular region of space, for example) is carried 
by the norm squared of i//, rather than by y/ itself. The operator H and the 
wave function y/ therefore have life, more precisely, on a suitable Hilbert space. 
We shall elaborate a bit more on the remarks briefly made in Chapter 1 concerning 
quantization and Schrodinger’s equation as we explore various examples. 



3. 1 The One-Dimensional Schrodinger Equation 

For “quantum” or “wave mechanics” considerations of a particle of mass m mov- 
ing in one dimension we proceed as follows, where for simplicity or for the sake 
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of definiteness we consider a type of idealized situation. Namely, suppose there is 
assigned to the particle a “wave” function i/a : ^ C on = {(x, 0|x, t G R} . 

Assume, for example, 

(i) t y/ix, t) is continuously differentiable on R for x G R fixed. 



(ii) X v/(x, t) has a continuous second derivative on R for t G R fixed, and 



(iii) X ^ i/a(x, t) (which is continuous on R, for t G R fixed) is in L^(R, dx) 
where dx also denotes Lebesgue measure on R : lv^(x, dx < oo. 

Thus, here, there corresponds to time t a vector y/t in a Hilbert space, in contrast 
to the classical mechanics situation where a real number ^(0 corresponds to t, 
X = d>(t) being the equation of motion: y/t G L^(R, dx). 

Let F be a potential energy function for the particle, say V for example is a 
continuous function on R. y/ is required to satisfy Schrodinger's equation: 



dyf If aV .. 



(3.1.1) 



where ^ ^ for /i =Planck’s constant. As shown in Section 1.3 of Chapter 1, one 

obtains the right-hand side of this equation by a heuristic quantization procedure. 
Let H denote the quantized Hamiltonian operator 

-h multiplication of F. (3.1.2) 

2m dx^ 

Schrbdinger’s equation may be written, alternatively, as 



ih^ = Hy/ 
dt 



or 



ftdw 

--^ + Hy/ = Q 
i dt 



(3.1.3) 



as noted in (1.3.9). We also noted in Chapter 1 that a solution y/(x, t) of (3.1.3) 
is obtained immediately if we know a solution y/(x) of the time-independent 
Schrodinger equation 



d^y/ 
2m dx^ 



+ (E-F(x))va(x) = 0, 



or Hy/ = Ey/. 



Namely, one simply sets 



(3.1.4) 



y/{x,t) = y/(x)Qxp(Et/ih). (3.1.5) 

For ^ continuous on R with y/ G L^(R), the latter equation defines a function 
y/ C which does satisfy conditions (i), (ii), (iii) above. 




3.2. The Quantized Harmonic Oscillator 
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3.2 The Quantized Harmonic Oscillator 

Take the harmonic oscillator for example with equation of motion 

X = ^(0 = C cos{cot - S), (3.2.1) 

where C is the amplitude of the oscillation, v = cojln is its frequency and 6 is 
some phase constant; see (2.2.17). The potential energy V was shown to be given 
in terms of the constant angular speed w\ 

2 2 

CO mx 0 0 o 

F(x) = (3.2.2) 

The time independent Schrodinger equation for the simple harmonic oscillator 
(equation (3.1.4)) therefore is 

cfx// ( co^mx^\ 

This is a differential equation of hypergeometric type, a type which we will study 
in Chapter 4. It will turn out that the only possible eigenvalues E (correspond- 
ing to a non-zero solution y/ G L^(R)) are E = En^= 2n(n + \/2)vfi, n = 
0, 1, 2, 3, — The energy levels of the quantum oscillator therefore cannot be ar- 
bitrary but are given by the discrete set of real numbers This compares 

with initial remarks made above. 

Although the potential energy V \ x ^ co^mx^ J 2 is continuously differentiable 
in this example (it is of course even a function), in many practical situations 

V will have a finite number of discontinuities, or even an infinite number of such. 

V in the next example will be discontinuous at two points. 



3.3 A Finite Potential Well 

As an example of a discontinuous potential, fix a,Vo> 0 and define 



V{x) = 




for - a < X < a 
for |x| > a. 



(3.3.1) 



V is continuous except at x = ±a: 

The graph of V (Figure 3.1) defines a “well” of length 2a of finite depth Vq. 
We consider the bound states of a particle in this well, i.e., E < 0. Roughly we 
have a one-dimensional model of a neutron in a nucleus or, for example, a model 
of the potential experienced by an electron in a molecule like acetylene C 2 H 2 : 
H - C = C - H with a triple bond between the two carbon atoms. 

Though we take E < 0 we assume E > -Vq. On the open set x > a, K = 0 
and Schrodinger ’s equation (3.1.4) becomes 
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Figure 3.1: Finite potential well 



The general solution of equation (3.3.2) is \i/{x) = Aexp(ax) +5exp(— crx) on 
X > a. Since exp(ax) ^ cx) as x oo (for a > 0) we choose A = 0; that is, 
our interest is in square-integrable solutions. Similarly onx < a,V = 0 and the 
general solution of the Schrodinger equation is y/{x) = C exp(crx) + D exp(-ax). 
Since exp(-ax) ^ oo as x -> -oo we choose D = 0: 



y/(x) = 



r Be-^^ 



for x> a 
for X < -a. 



(3.3.3) 



Let p = + ^/2m(E + we have assumed E + Vo > 0. On [-a, a], V = 
-Vo, and Schrddinger’s equation y/"(x) = -p^y/(x) has general solution 

y/(x) = E cos(px) + F sin(^x) on [-a, a]. (3.3.4) 

To make further progress we impose the boundary conditions of continuity of y/, 
y/' at the discontinuities x = ±aofV: 




3.3. A Finite Potential Well 
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y/{x) = < 


-aBe~^^ onx > a, 

aCe^^ on X < -a, 


(3.3.5) 




^ -PE sin(/?x) + pE cos(j^x) on {-a, a), 
Be~^^ - E cos(pa) + F sm(Pa), 






Ce~^^ = Ecos(pa) -Esin(pa), 


(3.3.6) 


-aBe = -pE sin(^a) + pE cos(Pa), 




aCe~^^ = pE ^iniPa) + pE cos(Pa). 




These equations give 




{B + C)e-““ = 2Ecos(l}a), 


(3.3.7) 




a(C - = 2/Jfcos(^a), 


(3.3.8) 




=:2Fsin(^a), 


(3.3.9) 




a(B + C)e-““ = 2pEsm(l}a). 


(3.3.10) 



Note that if both B + C = 0 and B - C = 0 then of course B aM C = 0. But 
also by equations (3.3.7) and (3.3.10), 2Ecos(Pa) = 0 and 2fiEsin(pa) = 0 
2Esm(Pa) = 0 ^ 4E^ = 0 => E = 0. Similarly B-C ^ 0 
F = 0. That is, both B + C,B-C = 0 => B,C, E, E = 0 if/ = 0. 
Thus assume y/ ^0. Then either B + C7^0 orB-C/ 0. Note also that 
then both cos(pa) 7^ 0 and sin(^^3!) 7^ 0. For if cos(Pa) = 0, then B + C = 0 and 
B - C = 0 by equations (3.3.7) and (3.3.8). Similarly if sin(^a) = 0, B + C = 0 
and B - C = 0 by equations (3.3.9) and (3.3.10). Now if B + C 7^ 0 then 
F 7^ 0 by equation (3.3.7) and we have (dividing the fourth equation by the first) 
a = ^tan(^fl). Similarly if B - C 7^ 0, then F 7^ 0 and -a = ficot{pa). Since E 
is determined by a, /? we have proved the following quantization condition. 

Proposition 3.1. For bound states if/ ^ 0 given by (3.3.3), (3.3.4), (3.3.5) with 
-Vo < E <0 we have for 



def \j2m{-E) 

(X — -\ 7 G M, 

h 

def \f2m{E + Vo) 

1 ^ 

h 



(3.3.11) 



that sin(^a) 7^ 0 and cos(pa) 7^ 0, and the following condition on the quantum 
energies E. Either B-\-C ^0 or B-C^O, If B + Cf^O, then E satisfies 
a = ptm(Pa). IfB — C 7^ 0, then E satisfies -a = f cot(fa). 

Note that in fact we cannot have both cases B + C 7^ 0 and B - C 7^ 0 at 
the same time. For this would give a = Ptan(Pa) and a = -pcoi(pa). Hence 
= -p^ =z=> both a,p = 0 which is a contradiction. Therefore if B + C # 0, 
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then B-C = 0, and hence also F = 0 by (3.3.8) or (3.3.9) since sm{j)a) 0 and 
cos(pa) ^ 0. By equations (3.3.3) and (3.3.4) therefore 

for x> a 

if/(x)= Be^^ for x < -a fovB + C^O, (3.3.12) 

E cos(pa) for -a < x < a 

and hence y/ is even: y/{x) = y/{—x) V x G R. 

Similarly if B - C 7 ^ 0, then B + C = 0 and E = 0 => (by equations (3.3.3), 
(3.3.4)) 

Be~^^ for x> a 

i//(x) = -Be^^ for x < -a ; (3.3.13) 

F sin(^fl) for -a <x <a 

hence y/{-x) = -y/{x) VxGRifB-CT^O. We see that the non-zero wave 
functions y/ are either even or odd functions, under the condition -Vq < E <^. 

Since {aa)^ + {aP)^ = ImVoCp' represents a circle about the origin with 
radius 7 = ^lmVo\, we can obtain the energy levels by the intersections of 
the circle with the curves y = xtan(x), y - -xcot(x), illustrated, in part, in 
Figures 3.2 and 3.3. 

For a radius y with 7 < | we see that there are no intersections in the odd case. 
That is, odd bound states cannot exist unless 7 ^ > ^ — i.e., unless Vq(^ > 
However we see that there is always a smallest bound state (called the ground 

y 




Figure 3.2: Tangent intersections 




3.3. A Finite Potential Well 
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energy state) which is even. For Vq, m, a fixed (i.e., for a fixed radius y) there 
are only finitely many bound states (as there are only finitely many intersections). 
Suppose the radius y satisfies < y < for n a positive integer. Then 

one sees that in the even case there are ^ intersections for « + 1 even and ^ 
intersections for w + 1 odd. Similarly in the odd case there are ^ intersections 
for « + 1 even and | intersections for « + 1 odd. Since y = ^2mVof we have 
checked the following. 

Proposition 3.2. For any Vo there are only finitely many bound states. Even bound 
states always exist, but odd bound states exist only ifVoO^ > 

For n = 1, 2, 3, . . . fixed suppose that 




Then the number of even bound states is ^ for « + 1 even, and ^ for n + \ odd. 
The number of odd bound states is ^ for n+\ even, and | for « + 1 odd. The 
number of bound states altogether (even and odd) is therefore « + 1. 

Again, in the above analysis the assumption —Vo<E<0,Vo>0is imposed. 
Since the radius y varies directly with Vg (and with m and a) the intersections (and 
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hence the number of energy levels — or bound states) increase in number as Vq is 
increased, or as m or a is increased. 

3.4 Schrodinger’ s Equation for a System 

Consider next a system of n particles moving in three dimensions, with rrij = mass 
of the 7’th particle, the motion being influenced by forces derived from a potential 
energy function V : R with continuous first partial derivatives. 

If (xj(t),yj(t),Zj(t)) is the position of the jih particle at time t, where the 
derivatives Xj,yj,Zj,Xj,yj,Zj exist and are continuous, then the conservation of 
total energy H = T + V is expressed as follows: 

\ X + Mtf + tjiff) 

7=1 

+ V{xi(t),yiit),zdt ), . . .,Xn(t),yn(t),Zn{t)) is constant (3.4.1) 
for all t, as we have seen in (2.4.8). 

For = rrijV^^ = mj(xj, yj, zj) the momentum of the jih particle (= (p[^\ 
p^\ p^^)), the first term in (3.4.1) (the kinetic energy term) is 

+ (3.4.2) 

j=\ j=\ j 

Turning to quantum mechanical considerations we suppose we have a “wave” 
function y/ : x R C for our system. For example, in an ideal situation we 

might have 

(i) t ^ y/(x, t) is continuously differentiable on R for x e R^" fixed, 

(ii) X xf/(x, t) : R^” C has continuous first and second partial derivatives, 
for t G R fixed, and 

(iii) X ^ y/(x, t) (which is continuous on R^", for t G R fixed) is in L^(R^", dx) 
where dx = Lebesgue measure on R^" : 




(3.4.3) 



(3.4.4) 



where (x\,yi, z\,X 2 , y 2 , Zi,--, x„, y„, Zn) are the coordinates of a point in R^". 
(3.4.4) is an extension of (1.3.10). 




3.4. Schrodinger’s Equation for a System 
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Let ^ ^ ^ - Then Schrddinger’s equation can be written as 



h dy/ 
Ini dt 




(3.4.5) 



One can obtain the right-hand side of the Schrddinger equation by the follow- 
ing heuristic procedure (again as in Chapter 1). To the “classical observables” 
p^i \ P^l\ (the momentum coordinates) assign the “operators” y^, 
y^, and to the observable V assign the operator which is multiplication by 

V. To the kinetic energy ^ p^^ -h p^^^ ^ there corresponds there- 
fore the operator - Y!]=\ energy H = 

T V (or “Hamiltonian”) there corresponds the quantized Hamiltonian opera- 
tor - + multiplication by V, which acting on y/ gives 

the right-hand side of the Schrodinger equation (3.4.4) (or (3.4.5)). 

For example, fix E € M and let y/ : C be a function with continuous first 

and second partial derivatives such that y/ e L^(R^", dx). Define y/ : R^" xR -» C 
by y/{x, t) = y/(x) exp ( ft) . Then y/ satisfies (i), (ii), (iii) above, and also satisfies 

= -Ey- 

Also 



d^y/ d^yr d^y/ I d^y/ d^y/ d^y/ 

^+-^ + -^ = 1— + ^ + — Jexpl-; 









dx^ 



dy; dz; 



(3.4.6) 



We see therefore that y/ satisfies Schrodinger’s equation 

\ fe2 



Ey/(x)exp ( ) = 



' 2 



^ 1 / \ / E \ / E \ 

> — (V;i/a) (x) exp I —t ]+V(x)y/(x) exp —t 

"my- ^ / 



y/ satisfies the time independent Schrodinger equation: 
2 .^1 rrij y dxj dyj dzj ) 



(3.4.7) 



(3.4.8) 



or 




/ ^ ^ 
[dxj dyj 




- (F - E)>y = 0, 



(3.4.9) 



or 



n 



I 



1 

ntj 



fixff ^ d^y/ ^ d^xff 



dx1 



dyj 



dzj 



+ ^(E - V)w = 0. 



(3.4.10) 



In practice the preceding ideal situation may not prevail. As in previous examples 
V might be defined on some subset of R^”, or might be a discontinuous function. 
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3.5 Schrodinger’s Equation for an Atom 

For a system (= an atom) consisting of n electrons, each with charge -e, and a 

def 

nucleus of charge Zc, (Z a positive integer = number of protons = the atomic 
number), there is an easy application of Coulomb’s electrostatic law. 

Let = (Xj,yj,Zj) e denote the position of the 7 th electron and let 
x^^^ = {Xo, yo, Zo) € R^ denote the position of the nucleus. The force on the 7 th 
electron due to the nucleus is, by Coulomb’s law (2.5.1), 



x(J) — 



(3.5.1) 



The other electrons (say n>2) contribute to the total force on the 7 th electron the 
amount of force 

The total force on the yth electron is therefore 



f (J) = -Ze^ 



AJ) _ y(o) 



|x 0 ) - x(»)l 3 

Regard as a function to on the set 



+ 



i=l,¥J 






(3.5.3) 



U 



lx = . . . ,x<">) G X r3" = r3("+‘)| 

^ !</<«, x^^* ^ x^‘\ / 7 ^ i, 1 < /, / < n} . (3.5.4) 



Write FO) = {Xj, Yj, Zj), as in Section 2.4 of Chapter 2: x<^> - x^"^ = (xj 
Xo, yj - yo, Zj - Zo), x^i'i - X® = {xj- Xi, yj - yi, zj - zd 



Xj(x) = 



-Ze^ixj - Xo) e^ixj - x,) 

|xO)-xWp’ 



|x0* — X^^^P 



i*J 



Yjix) = 



-Ze^iyj - yo) 



+ 



e iyj - yi) 



|xti* — x*^“^p “ |xti3 — X*'*p ’ 



(3.5.5) 



-Ze^izj -Zo) Mp e\zj - zd 
= -,:.rA + L 



|xti) - x^^^P ' |xti) — X^'*P ’ 

forx = (x<‘’>,x<‘>,...,x<">) G U. 

We show that the F^* are derived from a potential energy function V on U. 
Define F : 17 ^ R by 



F(x) = F(x<‘’\x*‘\...,x<"*) 
—Ze V- 



(3.5.6) 






1 </<;■<« 



|x(') -xO)|‘ 




3.5, Schrodinger’s Equation for an Atom 
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Now 

^ + {yi - yof + izi - ZofV^'^ 

= - ^of + {yi - yof + iZi - Zofy^'^ 2(x/ - x„)8ij 

and similarly 



d 1 

dXj 

= + {yi - yt)^ + {Zi - Zifr^^^ 2{Xi - Xi){6ij - 6ij) 



for 1 < 7 < « 
dV 



T“W = 



Ze^{xj — Xo) 



+ 



-e^(x, - X,) 



ax/ |x0)-x(»)|3 ■ |x«-xa)|3 

The second term here is 



{Sij-6,j). 



^ (xi — Xi)6ij ^ 



7; ^ ~ ^l)^lj 

i<l 



= —e 



= —e 



(Xy — X/) (X/ — Xj) 

2^ IvO) - y(0|3 ” 



(Xj — Xi) (Xy — X/) 



" |xO)-x(Op ^ |xO)-x(Op 



Z -g^(Xy - X/) 
lxO)_x(Op 

dF 

-- — (x) = Xj(x) for 1 < y < n. 

OXj 



Similarly 



Also 



dV dV 

-— (x) = Yj{x) and - — (x) = Z,(x) for 1 < y < n. 
dyj dzj 



= [(X; - Xof + {yi -yof + {Zi - Zof] 



dxo |x(*) - dxo 

= -\{{^i - ^of + {yi - yof + {Zi - Zofy^''^ 2(x,- - x<,)(-i) 
(x/ - Xo) 



|x (0 _x('>)p 

a 1 

dXo |x(') - xO)| 



(3.5.7) 

(3.5.8) 

(3.5.9) 

(3.5.10) 

(3.5.11) 

(3.5.12) 



and 



(3.5.13) 
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dV _ y -Ze^jXj - Xq) _ y Ze^jxp - x,) 

Similarly 



dV y ZeHyo-yd dK y ZeHzo-Zi) 
dyo^^ |x«-xwr ^|x«-xwr 



(3.5.14) 



(3.5.15) 



By Coulomb’s law the force f on the nucleus (of charge Ze) due to the n 
electrons is given by 









— x65|3 



(3.5.16) 



Thus = {Xo, Yo, Zo) where 



^c(x) = 2 



^ -Ze^(Xo - Xj) dV 



= 



|x(»)-xW|3 dXo 

^ ^-Ze\yo-yj) dV ^ ^ 

Yoix) = > — and 
" |x(o) - xo)|3 dyo 

^ -Ze'^iZo-Zj) dV ^ ^ 

|> - ,u - r 



(3.5.17) 



We see that indeed the are derived from a potential V. Given the potential 
energy V :U C ^ R of an atom, 

V(x) = 



" r7 2 

Z —Ze 



+ 



I 

l<i<j<n 






(3.5.18) 



for = {Xj, yj, Zj), Schrodinger’s time independent equation (3.4.10) for such 
a system is therefore 

0 = —(ylw)ix) + Y -(Vjvr)(x) + -^(E - V{x))w{x), (3.5.19) 

nio " m ^ 

y=i 

where rrio = mass of the nucleus and m = electron mass. The time-dependent 
equation (3.4.4) is 

m^{x,t) = -^ Y — (VV)(X,0 + F(x)v/(x,t) (3.5.20) 

dt 2 mj ^ 



where mj = mfor I < J < n. 
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The helium atom, for example, consists of two electrons, two protons (Z = 2), 
and two neutrons. The symbol for helium is He. The Schrodinger time indepen- 
dent equation for He is therefore 

— (V^v/)(x) + -(V>)(x) + -(V^v^)(x) + ^(£ - F(x))v^(x) = 0 (3.5.21) 

YHo m m 

forx = = (x„,y<„?:<„xi,yi,zi,X 2 ,>’ 2 ,Z 2 ) where 



F(x) = 



— 2e^ 

V (^1 “ + (j'l “ yo)^ + (^1 - Zo)'^ 

^ -2e^^ 

V (X2 - Xo)2 + (y2 - yoY + izi - Zo9- 

F ~ 

V (^1 - + (y\ - yi)^ + (^1 - Z2)^ 



(3.5.22) 



Even for a simple atom like helium, however, the Schrodinger equation has 
never been solved exactly. We shall therefore study the hydrogen atom (with 
only one electron and one proton, and for which the Schrodinger equation can 
be solved exactly) in detail, in Chapter 5. 



3.6 Non-Quantization of the Free Particle 



For a particle of mass m moving in space under a zero force field (V : 

R is zero) the time independent Schrodinger equation (3.4.10) takes the (sim- 
plest) form 



d^y/ d\ d^y/ 

dx^ dy^ dz^ 



8;r^ 

+ —mEy/ = 0. 



(3.6.1) 



We look for a solution y/ ^ 0 with y/(x, y, z) - y/\ (x)y/ 2 (y)y/ 3 (z) where y/j :R ^ 
C have continuous first and second derivatives, y/ y/\ <S)y/ 2 ^ ¥3 => ^ = 
^ <S) 1/^2 (8) 1 /^ 3 , with similar statements for Equation (3.6.1) becomes 



¥iM¥2{y)¥3(z) + ¥\M¥2(y)¥3iz) + ¥iM¥2(y)¥3(z) 

+ —mE¥\{x)¥2(y)¥3(z) = 0 (3.6.2) 

for (x, y, z) € R^. Choose and fix (y, z) € R^ such that ¥ 2 (y)^ ¥ 3 (z) 7^ 0. Divide 
by ¥2(y)¥3(z): 

¥2 (>^) ¥2 (^) 

¥i(x) + ¥\(x)^— + ¥i(x)-^— + Ci//i(x) = 0 (3.6.3) 

¥2(y) ¥3(z) 

for C = That is y/'({x) = Kiy/i{x) on R for ki = ^ - C = a 

constant (independent of x). 
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Similarly 1^2 (y) = *^2V^2(y). V^3 (^) = >^ 3 W 3 (z) on R for suitable constants 
K2, IC3: K2 = - c for some (x, z) € R^ such that f 0, 

'^3 = S V'lM, Wiiy) + 0. 

Note that 

¥lix)y/ 2 iy)¥ 3 iz)ilCl +K 2 + K 3 ) 

= -¥'2(y)¥iix)¥3iz) - ¥ 3 iz)¥i{x)¥i(y) ~ ¥iix)itf2{y)¥3iz) 

- ¥ 3 (z)¥i(x)¥ 2 (y) - ¥iix)¥2iy)¥3iz) - ¥2iy)¥\(x)¥3iz) 

- 3y/i{x)\ff2(y)¥3iz)C (3.6.4) 

= -2y/"ix)y/2iy)¥3iz) - 2y/2{y)i(/i{x)y/3{z) - 2y/2(z)¥iix)¥2(y) 

- 2y/i{x)uf2iy)¥3iz)C - w\(x)¥2{y)¥3iz)C 
= -¥iix)¥ 2 iy)¥ 3 iz)C by equation (3.6.2) 

Xi + KT2 + KT3 = — C. (3.6.5) 

Define E\,E 2 , E 3 by kj = -^^Ej. Then 

iy"(3c) + ^^Eivri(x) = 0 

¥ 2 (y) + —j^E 2 ¥ 2 iy) = 0 on R (3.6.6) 

V^3 U) + ^^^E3\ff3{z) = 0 

are time independent Schrodinger equations. By equation (3.6.5) and the defini- 
tion of C, E\+ E 2 + E 2 , = E. 

Conversely if : R C have continuous first and second derivatives, j = 
1, 2, 3, and satisfy the preceding three Schrodinger equations for constant E\, E 2 , 

£3, then y/ = i/Ai(g) 1/^2 <S) 1/^3 satisfies equation (3.6.1) for E = E 1 +E 2 +E 2 : Namely, 
multiply the first Schrodinger equation in (3.6.6) by y/ 2 (y)y^ 3 (z), the second by 
¥iM¥ 3 (z) and the third by ¥\(^)¥ 2 (y), and then add: 

¥'l{x)¥2iy)¥3iz) + ¥2iy)¥i{x)¥3iz) + ¥ 3 iz)¥\(x)¥ 2 iy) 

+ ^^^¥i(x)¥ 2 (y)¥ 3 iz)(Ei +E 2 + £3) = 0, (3.6.7) 
¥ 

which is equation (3.6.1) for E = E\ + E 2 + E^. 

Going back to the Schrodinger equations (3.6.6) y/j+^^Ejy/j = 0 J= 1, 2, 3, 
the general solutions are 

iJlmEjKX iJlmEjKX 

xf/jix) = A cos ; h B sin 

h 

= A COS h Bsm — , 

n n 



(3.6.8) 
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where pj = y/2mEj. Note: Regard Ej as total energy: Ej = + 0 since 

V = 0. Then 2mEj = rrP'v^ => pj = mv is the momentum. Since Ej can 
take on arbitrary values without restriction we see that the free particle is not 
quantized. Clearly, moreover, the y/j in (3.6.8) are not square-integrable over R. 
In contrast we have seen, for example, that in the case of the harmonic oscillator 
with frequency v one has the quantization (restriction) of energy given by E = 
En = (n + ^)vh for some integer n>0. 

3.7 Quantization, Large Eigenvalues, and Spectral 
Zeta Functions 

One can relate the quantization of energy of some quantum systems to a gen- 
eral result concerning large eigenvalues A of the second-order linear differential 
equation on R 



d^y 

T^ + U-^(x)Mx) = 0; 
dx^ 



i.e., Dy = Ay for D = -H q(x) 

dx^ 



(3.7.1) 

(3.7.2) 



The result, roughly, is that A satisfies 



'bx 

[A - dx = 




(3.7.3) 



for some a^, bx € R and some positive integer n. Disregarding the term 
for large A one obtains from (3.7.3) a general kind of (approximate) quantiza- 
tion condition, which when specialized to Schrodinger’s equation amounts to the 
Bohr-Wilson-Sommerfeld quantization rule; compare (3.7.10), (3.7.13) below. 
The result. Theorem 3.1 below, was developed by Kemble, Zwaan, Langer, Titch- 
marsh, Olver, and others. We will state Giver’s version of it, which to some extent 
follows Titchmarsh. Compare [58, 67, 81] for, example. 

The following assumptions are set forth. 2 > 0 and limx^±ooy(x) = 0, 
lim^^ioo ^(3c) = + 00 . q is of class on R (i.e., q',q",q'" exist and are con- 
tinuous on R), and q satisfies 



(3.7.4:i) 3 positive constants a,P such that q(x) 

^a-\ 



linix- 
as X ■ 



g = 1), q'{x) ~ ^ax“ 



as X 



px“ as X ^ 00 (i.e., 
00 , q"(x) = 0(x“-2) 



00 (i.e.. 






is bounded), q'”(x) = 0(x® ^) as x oo. 



(3.7.4:ii) 3 positive constants a, p such that q(x) ^ p(-xY, q'(x) ^ -pd(-xY ^ , 
9"(X) = 0(x“-\ g"'(x) = 0(x®-3) as X -oo. 



These assumptions on q are satisfied if we take q{x) = Ax^" for example, where 

A > 0, n = 1,2,3, Namely, in (i) take a = 2n,P = A; asymptoticity ~ 

becomes equality. In (ii), similarly take d = 2n,P = A. 
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3. Quantization and the Schrodinger Equation 



Using assumptions (i), (ii) one can argue that 3 M > 0, sufficiently large such 
that for i > M the equation q(x) = A has exactly two real roots, say gR 

with < 0 < bx’, actually (l) ’ - A < 0 on 

[ax,bx]. In the example q(x) = the equation q{x) = A (for any A) has 

exactly one negative root cix = -\j) , and one positive root = + ( ^ ) 

(by Descartes’ rule of sign for example). With the above hypotheses and notation 
in place, the following is proved in [67, Sections 9.2-9.4]. 

Theorem 3.1. Let y he a non-zero solution of equation (3.7.1) (with lim y(x) 

X^±QO 

= Oj. Then 



[i - dx= ^n+^^M + 0 (3.7.5) 

for some positive integer n. 

Thus for A sufficiently large 

cb, / 

U-q{x}]'/^ dx-(n + 

JflA \ 

for some constant C > 0 independent of A. 

For the Schrodinger equation (3.1.4) 

cfiur 2m 

^ ^ - 1"(^))V^(^) = 0 (3.7.7) 






VI 



(3.7.6) 



on R with lim^^ioo wM = 0, A = ^ and q(x) = ^V(x). Then the equation 
q(x) = A means that V(x) = E, say with real solutions aE,b£ at least for E 
sufficiently large; here we take E > F(x).Also [A-q{x)ff^ = ^[E-U(x)]^/^, 
so that equation (3.7.5) assumes the form 

= + (3.7.8, 

or 

+ (3.7.9) 

Thus the condition 

I ^ yj2m{E - F(x)) dx=(^n+fj nh, n=l,2,... (3.7.10) 



determines approximately the allowed energy levels. 
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As an example consider the quantized harmonic oscillator. V{x) = 
iTT^v^mx^ in (3.2.2) so that V{x) ^ E ^ ^ 

1 [lE u - _L 1 ^ 



X - ±-\ That is, qe = 

(o y m ’ ^ 




dx 



(3.7.11) 



Q) 



CO 



Since 



c?- -x^ dx = ^ 



■X2 + 



^ arcsm - =» I -x^ dx = 

2 2 Jo 4 

(3.7.12) 

for > 0. Condition (3.7.10) therefore means that E = (n+^)coh = (n+j)27rvh, 
so that we obtain precisely the energy levels {E„}^o mentioned following equa- 
tion (3.2.3), apart from Eg. 

The condition (3.7.10) also amounts to the WKB (Wentzel, Kramers, Brillouin) 

def 

eigenvalue approximation, for the classical region Re = [x G R|L(x) < E}, 
where the solutions aE, bE of V{x) = E aiQ called turning points; see Figure 3.4. 

Classically the particle of momentum p = mv has kinetic energy ^ = T = 
E -V, and is confined to in its motion. That is, p = \jlm{E - V) and one 
also writes (3.7.10) as 

r^£ 






ttH. 



(3.7.13) 



For the special potentials q(x) = Ax^^ above, with A > 0, M = 1, 2, 3, ... , 
the differential operator D = -^ -h q(x) in (3.7.2) has the following special 
properties, as shown for example by Titchmarsh [81]. D is positive and has a 
compact resolvent. Thus D has a discrete spectrum where nj is the 

multiplicity of the eigenvalue Aj; in fact each nj = 1. 0 < Ag < M < A 2 < . . .; 
limfc^oo h = 00- The Aj depend on M of course and we sometimes write Aj = 
Aj(M).Om has (cf. (3.7.5)) 

/ i\ /i\ A 

[An - Ax^^]^^^ dx = («+-)^ + 0(-), where a = —. (3.7.14) 
J_^i/( 2 M) V 2/ \n/ A 

We shall compute the integral in (3.7. 14) and derive the following spectral asymp- 
totics. 

Theorem 3.2 (Titchmarsh). 

(2„+ 1) r(^ + 1) 



A„{M) 



r(2^) 



2M 

M+l 



00. (3.7.15) 




68 
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y 




Figure 3.4: Turning points for the classical region Re 



First, for > 0, « = 1, 2, 3, . . . let 



I = I„{a) = f - f" dt, 

'-'-CffW 



xJe_^ 

2n 



dx 



= dt 

Jo 

= [ \/T^ 

Jo 

, def a<'’+‘>''" rtj>) r(^) 

= B(p,q) = 

n 

_ r(^) r(f ) 

n rfi + 2') ’ 

'■ '^2n ^ 2> 



(3.7.16) 



(3.7.17) 



n ^(j> + q) 
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where B is the Beta function and F is the Gamma function: 

coo 

dt forx>0. 



r(x) = 



(3.7.18) 



0 



Now r(| = 1 + i) = i r(i) = Hence 
Proposition 3.3. For a >0,n= 1, 2, 3, . . . 



(P- - dx = 2 \j cP - dx 

Jo 



Jn+l)/n 



2n 



-\[k 






(3.7.19) 



For « = 1 we obtain the elementary calculus result 

f" JX = ^ 

Jo 4 

of (3.7.12). 

Corollary 3.1. For i, ^4 > 0, M = 1, 2, 3, . . . Ze? 

r(i)'"“’ 



(3.7.20) 



/m(2) = 






-(i) 






Then 



ImW = A-V(2M) 



r(^) 

2M + 2) 



(3.7.21) 



(3.7.22) 



Proof. By the change of variables r = /m(>^) = A •Jm(^^'^^) for 

given by (3.7.16). Thus Corollary 3.1 follows from Proposition 3.3. □ 

By Corollary 3.1 and equation (3.7.14) 



^-l/(2M) 



\[n F( ) 



2M n-i- + - 

^ 2M ^ 2 



-=(„0,.^o(l). (3.7.23) 



Let 



Cm = — . f >0, 



2Mr(5^ + f) 

so that for some natural number n„ and > 0 



(3.7.24) 



M+1 2 

Cm4„“ -(«+2)^ 

- 1 

(n+ ^)tu 



< K- for n>Ho 
n 



K 



(3.7.25) 



n(n + ^)7T 



for n > Hq. 
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Let « 00 and conclude that 



lim 



M±i 

T 2M 



{n + \)7i 



= 1 



lim 

n-^oo 



That is, 



lim 

n^cc 






Cm 

{n+\)n 



= 1 , 



IM 

M+\ 



A„ = 1. 



(3.7.26) 



(3.7.27) 






Cm 



which proves Theorem 3.2 since 

in + i);r (2n + + |) 



Cm 



r(^F) 



0.12%) 



Given the discrete spectrum [Xj{M), rij] of D = + Ax'^^ , A > 0, with 



0 < Ao < 2i < • • • , lim Xk = oo,Xj = Ay(M), 

k-^co 

we form the corresponding spectral zeta function 

00 00 .. 

CDis) = CMis)='^^ = '^X 

j=0 j=o 



(3.7.29) 



(3.7.30) 



as we have noted that each «; = 1 . Using the spectral asymptotics of Theorem 3.2 
we easily deduce 

Corollary 3.2. The zeta function (3.7.30) converges absolutely for Re s > 
and is holomorphic on this half-plane. 

Proof. We have noted that Theorem 3.2 is expressed by equation (3.7.27): as 

Hpf 

n^oo,a^ a(M) “ 2M/(M + 1) 



(«+l/2)^l“ 

Cm 



i2n+l)n; 

2 Cm 



(2«+l)« /2CmV def^ 

^ \ ^ j 



Thus for c = 1, choose a natural number N such that n> N 



(2n+l)« 



- Cm 



< 1 



(2n+ 1)^ 
An 



< 1 + Cm 



J_ ^ 1 + ^M 
An (2«+l)« 




(2n+ 



(3.7.31) 



(3.7.32) 
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where 



Thus 



z 



n=0 



1 

( 2 «+ 



< 00 



for a Re s > 1. 



(3.7.33) 



00 



Z 




(3.7.34) 



also converges for a Re 5 > 1, as asserted in Corollary 3.2, since a = 2M/(M + 
1). To see that fi) is holomorphic on the domain Sm = {5 6 C|Re 5 > 
it suffices to show, by Weierstrass’ theorem, that the series Y ^c)nverges 
uniformly on compact subsets of Sm- In fact a little bit more is true: If <5 > 0 is 
arbitrary, then the series Yi7=o Y converges uniformly (and absolutely) on Re 5 > 

+ S. Namely, on this domain we have for n sufficiently large, and a 
\^\ > \ see (3.7.29). That is. 



141 " XT' 



(3.7.35) 



where 



y — 

JLd ^a+S 

n=0 



< 00 , 



2 isa + S > a = The M-test therefore applies. 



(3.7.36) 

□ 



We remark that for M = 1, A = 1, i.e.. 



D = 



dx^ 






(3.7.37) 



the eigenvalues are given by = 2«+ 1, n = 0, 1, 2, . . . , as D is essentially the 
quantized Hamiltonian of (3.1.2) for the harmonic oscillator; cf. (3.2.2), (3.2.3); 
then clearly (3.7.29) holds. In this case one has in fact equality in the asymptotic 
assertion of Theorem 3.2: The right-hand side there is (for M,A= 1) 



V^(2n+l)r(l/2-H3/2) 

r(i/2) = 



= (2«+l)T(2) = 2/1+1. 



(3.7.38) 



Also the zeta function ^d(s) of (3.7.30) (which converges absolutely for Re 5 > 1, 
by Corollary 3.2) is given by 



Cd(s) = ( 1 where C(s) ^ ^ 

^ '' n=l 



(3.7.39) 
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is the Riemann zeta function. Namely 



^ , def V 1 ,111 

n=0 

, 111111 

= l + T — \~ - — \- - — f -- — \- — — \- — !"••• 

2S y 45 55 55 75 

1111 




( 3 . 7 . 40 ) 



as desired. In particular we can use (3.7.39), and the theory of Riemann’s zeta 
function to meromorphically continue l^j) (for D = -^ + x^) to all of C. 

For M > 2 the eigenvalues 2„(M) are unknown. However, even for M > 2 the 
zeta function (for D = -^ + Ax^^) does admit a meromorphic continua- 
tion to C. We shall return to this zeta function, and others, in later chapters. For 
example see Chapter 15. 



3.8 The Diffeomorphism O Again 

In Section 2.8 of Chapter 2 we constructed a diffeomorphism O of R® which 
provided passage from rectangular coordinates (xi, yi, zi, X2, y2, Zi) to Jacobi, or 
center of mass coordinates, (xi -X2, y\ -y2, Z\—Z2, R\,R2^ R3) where Ri,R2, R3 
are given by (2.8.1) for fixed masses mi, m2. The transformation of a “scaled” 
Laplacian, via O, was given by formula (2.8.17). We now apply that formula in 
connection with the time independent Schrodinger equation. 

Let U = {(xuyi,zuX2,y2,Z2) ^ ^^1 (xi,yuzi) 9^ (x2,y2,Z2)} as in Sec- 
tion 2.5 of Chapter 2. For p = (xi,yuZuX2,y2,Z2) ^ U, 0(p) ^ (0,0,0, 
X2, >^2’ 4)’ otherwise xi - X2 = 0, yi - y2 = 0, zi - Z2 = 0 (xi, yi, zi) = 
(x2,y2,Z2)‘ That is, 0(C7) c (M^ - {0}) x M^. Conversely let a G a = 
{ai, 02,03,04, as, ae) with (01,02,03) (0,0,0). Write = d>(x), x = (xi,yi,^i, 

X2, y2» Z2) ^ By Corollary 2.1 
^2^1 

Xi = + 04, X2 = Xi - Oi, 

m\ A m2 

'”2^2 , o , 

yi = ; A Os, yi^yx-cii. (3.8.1) 

m\ -I- m2 

m203 

Zi = ; h Cle, Z2 = Z\ - CI3. 

m\ + m2 

Then (xi, yi, zi) 7^ (x2, y2, Z2)\ otherwise a\ = 02 = 03= 0; that is, x € 17 : 

Proposition 3.4. 0(17) = {(xi,yi,^i,X2,y2,^2) e M^|(xi,yi, :^i) ^ (0,0,0)} = 
(M^-{0})xM^ 

Recall the potential V :U given in equation (2.5.4): 
R(xi,yi,zi,x2,y2,:^2) = 



/ (xi - X2)2 -I- (yi - y2)2 + (zi - Z2)^ 



(3.8.2) 
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By (3.8.1) O ^{xuy\,Z\,X 2 ,yi,zi) = 



mixi ni2yi m2Z\ , 

+ ^ 2 ? — ; 1 " yi, — : 1 " Z2^ 



m\ + m 2 
m2X\ 



m\ + mi 



m\ + mi 



m\ + mi 



+ X2- Xu _ +yi-yi, + Z2-Z1 ; hence (3.8.3) 



m\ + mi 



m\ + mi 



(KoO ^){xi,yi,z\,xi,yi,zi) = —==== onO((7). (3.8.4) 

C/,0(C/) c are open and O : (7 ^{U). Thus if / : 0(t/) ^ R has 

continuous first and second partial derivatives, and 




dxj dy] dz] 



7 = 1,2, 



then formula (2.8.17) can be written as 



(3.8.5) 



l(V^F)(a) + —(VlF)(a) 

mi ^ mi ^ 

= (— + —) (V?/)(1>(«)) + ^^(V2/)(0(a)) (3.8.6) 
\mi mi J m\ -\-mi ^ 

for a eU,foY F = / o O on U. 

Suppose, for example, that i/a : R^ C is continuous and ij/lu has continuous 
first and second partial derivatives. Define f = y/ o 0“^ : R^ -> C so that / is 
continuous. 

As : 0(C/) U is and y/ : U Cis class C^, / = y/o0~^ : 0(f7) ^ 
C is class Also 0-*(0(a)) = a on U F = yf onU.For (i.e., 

i + ^) we therefore have 



1-(y 2 )^a) + —(Ylyf){a) 

m\ " mi ^ 

= O <D-‘))(4>(a)) + o 3>-‘))(0(a)) (3.8.7) 

mi + m2 



for a £U. fiis called the reduced mass. The time-independent Schrodinger equa- 
tion (3.4.10) for y/ can therefore be written (where we take x = 0(a)) 



-(V?(VroO-‘))(x) + 



1 



mi H- mi 



(V2(v^oO-‘))(x) 



+ — (£ - (V o 4>-‘)(x))(v^ o <&-')(x) = 0 (3.8.8) 
¥ 
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on d>((7) = (M? - {0}) X where (by (3.8.4)) 

m<KU). (3.8.9) 

We shall write x = (xi,yi,zi,X 2 ,y 2 ,Z 2 ) = (x^^\x^^^) G x for x G E^. 
Assume / 7^ 0 has the form / = /i <S) /2 for /i, /2 : E^ C continuous, where 
/i, /2 have continuous first and second partial derivatives on E^ - {0}, E^ : 

fix) = /l(x<‘>)/ 2 (x®) 

(V?/)(x) = (V^/,)(x<‘>)/2(x<2>), (3.8.10) 

(V^/Kx) = /i(x<‘>)(V^/2)(x<2)) 

for X G 0(17) — i.e., x^^^ G E^ - {0}. 

On 0(17) Schrodinger’s equation becomes 



+ — i — (V^/2)(*“')/i(»'") = 0. (3.8.11) 

mi + m 2 



As /2 7^ 0 choose and fix some x^^^ G E^ such that f 2 (x^^^) 7^ 0. Divide by 
/2(x^^^) to get V x^^^ G R^ - {0} , 



0 = ^ (3-8-12) 



where 

1 (V^/2)(X<^>) 

mi + m 2 /2(x®) 
is a constant (independent of x*'*) : 

(v;/, )(»"') /'•*"’) = 0 



(3.8.13) 



(3.8.14) 



onM^- {0} forc= 1^. 

Similarly /i 7^ 0 on ^ /i 51^ 0 on R^ - {0} by continuity 3 x^'* e 
R^ - {0} (which we fix) such that /i(x^'^) ^ 0. Divide by /i(x^'^): 



2(V2/i)(x<») 

/l(x<») 







/2(X®) 



+ 



1 



(V^/2)(x("^) = 0on : 



(3.8.15) 



m\ + m2 
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+ (mi + m 2 )— [E- 



(/ni+m2)(Vi/i)(x<^>) 



a constant (independent of or 



on for 



(V"/2)(x®) + — (mi + m2)£2/2(x®) = 0 



|x('>| ^^S7r'^ /l( 2 C<*>) 



£1 =£ + C =^£+^Co 

def^ _^_l_(V|/2)^ 

8;t 2 ffii + ffJ2 /2(x®) 



Then we note that 



/i(x'‘^)/2(x®)(£i+£2) 



-/l(x<'>)/2(x'^') 



_^_J_(V^/2)^ 

8;t 2 wii + m2 /2(x®) 



+ £- 



gig2 (V^/i)(x<‘>) 

|x<'>| A<8;r2 /i(x<‘)) 



mi + m 2 



(V^/2)(x®)/l(x^‘*) + 



/1(X<‘^)/2(X®) 



+ -(V?/i)(X<‘>)/2(x®) + ^/i(x<‘>)/2(x®)£ 

Q -.2 

^/i(x<‘>)/2(x'^>)£, 

¥ 



by equation (3.8.11), 
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Theorem 3 . 3 . Under the separation of variables f = /i 0 /z y^ith f a solution 
of the time-independent Schrodinger equation 

-(V?/)(x) +^{E-{Vo <D-')(x)) fix) + —^{Ylf)ix) = 0 (3.8.21) 
// ' mi+m 2 

on $({/), we can find constants E\, E2 such that E = Ei+ E2 and such that 

(V?/i)(x<‘>) + /,(x<*>) = 0 (3.8.22) 

/orx^'^ = (xuj'i,?!) € - {0}, 






(V2V2 )(x '"0 + -^(mi + m2)E2f2ix^^^) = 0 



/( 7 /-X® = (x2,y2,Z2) e M^. Here 
V(xi,yuZuX 2 ,y 2 ,Z 2 ) = 



dlQ 2 



V (^1 - ^ 2 )^ + (yi - y2f + (?i - Z2)^ 



(3.8.23) 



(3.8.24) 



onlJ = {ixx,yx,z\,X2,y2,Zi) e'Bfi\{xi,yi,z\) ix2,y2,Z2)} => 

(F oO“')(xi,j’i,j:i,X 2 ,>' 2 ,^ 2 ) = (3.8.25) 

|x(»| = \Jx\ + y\ + z\ 

on 0(17) = {{xx,yx,zx,x2,y2,Z2) e R«|(xi,y,, Ji) 7 ^ (0,0,0)}. n = 

Conversely, let /i,/2 : C (continuous) be given with continuous first 

and second partial derivatives on R^ - { 0 } , R^ . Let £1 , £2 be constants such that 

(V?/i)(xO>) + ^^Ex- /,(x<0) = 0 (3.8.26) 

onR^ - {0}, 



0-.2 

(V^/2)(x<^>) + — (m, + nt2)£2/2(x<'*) = 0 



(3.8.27) 



on R^. 

Multiply the first equation by and the second equation by 



i(V?/i)(xO))/2(x(2)) + ^^Ei- /i(xO))/ 2 (x'^>) = 0 (3.8.28) 



and 



^^(V^/2)(x®)/,(xO>) + ^£2/l(x^'’)/2(x<^>) = 0. 
mi + ffi2 h^ 



(3.8.29) 
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Add the last two equations: 

-(V?/l)(^‘*’)/2(x®) +^-^(e, + E2- /1(X<'))/2(X®) 

+ (V^/2)(x'^')/i(x<'>) = 0. (3.8.30) 

mi + m2 

Thus for / /i 0 / 2 , £ = £i + £ 2 , (3.8.21) holds: 

-(V^/)(x) + ^(£-(Fo<D-')(x))/(x) + ^^(V^/)(x) = 0 

^ ‘ ' ' m\+m2 

(3.8.31) 

on 0(17). Here / : C is continuous with continuous first and second partial 

derivatives on 0(17). 

3.9 Some Brief Remarks on Axioms 

In the reference [84], J. von Neumann presents a rigorous mathematical founda- 
tion of quantum mechanics based on the spectral theory of Hermitian (= self- 
adjoint) operators on a separable Hilbert space. It is not our purpose to duplicate 
the rigors of that great treatise. We offer some brief, somewhat leisure comments 
on some of the basic axioms of the subject. Such comments complement some of 
the ideas of Chapter 1 and serve as background for some remarks and examples 
that will arise in Chapters 5, 6, 7, 9, and 10. Other works with sufficient rigor, of 
particular interest to mathematicians, are those of F. Berezin and M. Shubin [3], 
G. Mackey [62], E. Prugovecki [71], A. Sudbery [80], B. van der Waerden [83], 
and H. Weyl [87], for example. 

Observables of classical mechanics are the basic dynamical variables such as 
position, momentum (linear and angular), energy, etc., or functions of such vari- 
ables. To the observable p = linear momentum, for example, we have assigned 
the self-adjoint operator p = (in a one-dimensional set-up), though we 

have not been careful about specifying the (dense) domain of p. For the case of 
angular momentum, note the assignment given in (5.4.4). In general one of the 
basic axioms of quantum mechanics is that to each classical observable O there is 
assigned a Hermitian operator O on a complex separable Hilbert space H. Here 
H is the state space of some given quantum system, a state of the system being 
represented by some non-zero vector y/ e H. Compare for example the sentence 
following condition (iii) in Section 3.1. More precisely, and axiomatically, the 
states of the quantum system are the one-dimensional subspaces of H\ thus yr 
and cy/, c G C, represent the same state. 

As O is self-adjoint its eigenvalues A are real. Another basic axiom is the fol- 
lowing: 

The only values that a measurement of O can yield are the 
eigenvalues X of O. 



(3.9.1) 
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If y/ e H - {0} is an eigenvector corresponding to the eigenvalue A of 6, then 
y/ describes the state of the system in which O assumes the definite value A. 
For O equal to the quantized z-component of angular momentum, for example, 
with eigenvalues A = ftm, for certain magnetic quantum numbers m (see Theo- 
rem 5.3 of Chapter 5), we see that by (3.9.1) a measurement of O (which is the 
z-component of the classical angular momentum of a particle) can yield only the 
values hm. Later we will take \hm\ = h\m\ to be the magnitude of O. 

We will need to consider the expectation value of O in the state y/. One 
takes as axiomatic the following definition 



- def { 0 ¥,¥) 

^ (¥,¥) 



(3.9.2) 



where (, ) denotes the inner product structure on H. See definition (6.2.7) for 
example, where y/ there is normalized: {w>w) = 1- Suppose for example that 
is a complete orthonormal basis of H consisting of eigenvectors of O : 



Oy/n = KWn- (3.9.3) 

Then foxy/ e H arbitrary, 

00 

¥ = ^ an¥n for a„ = (w, y/„} (3.9.4) 

«=1 

where convergence is taken in the strong sense: 

\\y/ - the partial sum|| -^0 as n ^ oo (3.9.5) 

00 

for||vr|pH'(vr,Vr) = 2l««l'- (3.9.6) 

n=l 

Using (3.9.3) and (3.9.4) we can express the numerator in (3.9.2) as follows: 



(dy/,11/) = ^ {da„\ff„,amy/„) 

n,m=\ 

00 00 

“ ^ndm^nbn,m ~ • 

n,m=l n=\ 

That is, by (3.9.6), (3.9.7) 

00 

n=l 

I i2 ^ 

def l««l V, V 1 

Pn = — ; hence 2jP„ = l. 

Z l«"l^ 

n=l 



(3.9.7) 



(3.9.8) 
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To understand the meaning of (3.9.8) consider for example a sequence of n 
discrete values {xi, . . . , x„} where Xj occurs with frequency /,. Then 

(3.9.9) 

j=l 

is the ordinary mean value of the sequence. Of course we can write 

n f n 

X =^Xj pj for pj ; hence ^Pj ~ (3.9.10) 

” 7=1 

Thus Pj is the relative frequency with which the value Xj occurs in the sequence. 
Or Pj is the probability that the value Xj would be obtained by a random selec- 
tion from the list {xi, . . . ,x„}. Keeping (3.9.1) in mind, and comparing (3.9.8) 
with (3.9.10), one is lead to postulate that in (3.9.8), is the probability that a 
measurement of the system in the state y/ will yield precisely the value More- 

over, equation (3.9.8) expresses as a weighted average of a large number of 
measurements of the system. In particular if the state y/ is normalized we obtain 
a statistical meaning of the values In Section 7.7 of Chapter 7 we will 

have, for example, certain time-dependent coefficients an(t) where the |a„(0l^ will 
represent certain transition probabilities. 

It seems fitting at this point to mention also a fundamental result of Kato. 
The theory and applications of quantum mechanics proceeded for many years 
with no formal proof of the essential self-adjointness of the basic Hamiltonians 
H - - Laplacian -hF occuring in Schrodinger’s equation. This fundamental and 
critical question for the mathematical foundations of the theory was certainly on 
J. von Neumann’s mind in the late 1920s and afterwards. Finally in 1951, Tosio 
Kato (1917-1999) established essential self-adjointness for atomic Hamiltonians 
(defined initially on a dense domain of smooth, compactly supported functions) — 
i.e., for potentials V given in (3.5.18); see references [22, 56, 57, 71]. Kato and 
others have extended his result to other types of Hamiltonians to the point that 
now the issue of self-adjointness is settled for all of the practical cases of interest 
in quantum mechanics. 




4 



Hypergeometric Equations 
and Special Functions 



4.1 Initial Remarks 

Many of the important partial differential equations of applied mathematics and 
theoretical physics are second order and are studied by the method of separation of 
variables which leads to a set of second-order ordinary differential equations. This 
is the case, for example, for equations of Laplace and Helmholtz and equations 
of Schrodinger, Dirac, and Klein-Gordon, say when one has a Coulomb type 
potential. The ordinary differential equations, arising by separation of variables, 
can be reduced in fact to equations of hypergeometric type — a type which we shall 
describe shortly. 

In the standard treatment in books on quantum mechanics these equations, aris- 
ing from Schrodinger’ s equation, say, are solved by the method of power series (in 
the few cases where solutions are possible). One thus obtains the wave functions 
and the desired energy levels of the quantum system. We shall present an en- 
tirely different approach, due to Arnold Nikiforov and Vasilii Uvarov [66] which 
is more uniform and elegant. For example, as will be seen later, energy levels will 
have the very general form 



dr n(n - 1) 
dx 2 dx^ 



(4.1.1) 



for « = 0, 1,2, . . . where c, r are polynomial coefficients in a hypergeometric 
equation (arising from Schrodinger’s equation) of the form (4.2.2) below. As 
deg a < 2, deg r < 1 the indeed are scalars. Thus we begin with the study 
of special second-order ordinary differential equations, those of hypergeometric 
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type, a study which provides in fact a uniform approach to the special functions 
of mathematical physics. 



4.2 Differential Equations of Hypergeometric Type 

A differential equation of hypergeometric type is one of the form 

+ oTu! + d-M = 0 (4.2.1) 

on some domain of real or complex numbers, where a, a, f are polynomials with 
deg cr, deg cr < 2, deg f < 1. A special case of (4.2.1) is 



(7/ + ry' + Ay = 0 (4.2.2) 

where a, r are polynomials, deg g <2, deg t < 1, and A G R or C. Namely write 
(4.2.2) as G^y" + Gry' + Atry = 0. Thus take f = r, g = Ag. We shall call (4.2.2) 
a canonical form of (4.2.1). Following A. Nikiforov and V. Uvarov [66] we show 
that (4.2.1) can be reduced to (4.2.2). First we consider some classical examples. 



Classical Examples 



(1 


-x^)y"{x) 


- 2xy\x) + Ay(x) 


= 0 on R 








(4.2.3) 




y"(x) 


- 2xy\x) + Ay(x) 


= 0 on R 








(4.2.4) 


xy"{x) + (a + 1 


- x)y\x) + Ay(x) 




c e 


R 


|x>0} 


(4.2.5) 


(1 


-x^)y"{x) 


- xy'(x) + A^y(x) 


= 0 on (-1,1) 








(4.2.6) 






x^y"(x) + xy'(x) + 


(x^ - 2^)y(x) -- 


= 0 


on 


R- {0} 


(4.2.7) 


x(x - 


■ i)y"{x) + 


[(a + ^ + l)x - Y]y\x) + aPy{x) - 


= 0 


on 


X > 1 


(4.2.8) 






ax^y"{x) + Pxy'{x) + yy{x) - 


= 0 


on 


R+ 


(4.2.9) 



are equations of Legendre, Hermite, Laguerre, Chebyshev, Bessel, Gauss and 
Euler, respectively. Except for equation (4.2.7) all are examples of equation (4.2.2). 
Equation (4.2.8) of course is the Gauss hypergeometric equation. 

We shall also consider Jacobi’s differential equation: 

{l-x^)y"{x) + [-(a + l) + 2)x + l}-a]y'{x) + ny(x) = 0 on (-1,1). (4.2.10) 

Some standard polynomials from advanced calculus are given as follows, in Rod- 
rigues form. Let n,m > 0 be integers. The Legendre polynomials Pn of degree n 
are defined by 
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P^{x) = -\f (4.2.11) 

2^n\dx^ 

for « > 1; we set Po(x) = 1. 

For example P\{x) = x, PiM = Psix) = - |x. Similarly, the 

Laguerre polynomials of degree n are defined by 

L-(x) = (4.2.12) 

n! dx" 

for X 7 ^ 0; we set = ^ 7 ^ 7 ^. More generally we will replace mby a eR. 
For example L^(x) = 1, L^(x) = l+m — x, L^ix) = |(m + l)(m + 2) - (m + 

2 

2)x + y. The Hermite polynomials if„ of degree n are defined by 

ff„(x) = (-1)" cxp(x^)^ cxp(-x^). (4.2.13) 

dx" 

Hoix) = 1, Ffi(x) = 2x, Hiix) = 4x^-2. A generalization of the Legendre poly- 
nomials Pn is given by the three-parameter family of Jacobi polynomials Ph^'^^ of 
degree n defined by 

Pn'^\x) = ^(1 - x)-“(l + x)-^^ [(1 - X)“+”(1 + xY^"] (4.2.14) 

on (-1, 1) for a,p € R', Ph^'^\x) = P„(x) on (-1, 1). We shall prove that, for 
example, Ph^'^^ is a solution of Jacobi’s differential equation (4.2.10) provided 
n{a + ^ + n + 1) = //. Similarly Hn,Pn,L^ are solutions of equations (4.2.4), 
(4.2.3), (4.2.5) provided that A = 2n, n(n+ 1), n, respectively, there. Given Pn, the 
associated Legendre functions Pf are defined by 

p^ix) = (1 - x^r'^ — P„{x) (4.2.15) 

dx’^ 



on (-1,1). 



4.3 Reduction to Canonical Form 

Let / : C C be a polynomial function of degree < 2: f{z) = az^ + bz-\- 
c, a,b,c G C. Let L-b^ -Aac be the discriminant of /. If a 7 ^ 0 (deg / = 2) then 
completing the square, one writes f{z) = ai(z+:^)^-i^^)) = 

Choose a square root ao of a: al = a => fiz) = {aoiz + ^))^ - Thus if 
A = 0, / iz) has a square root aoz + ^ for a 7 ^ 0; here aq 7 ^ 0. If a = 0 and 
A = 0, then b^ = 4ac ==> b = 0 => f(z) = c. Thus if cq is a square root of c, 
the constant function z ^ cq is a square root of /. This proves 

Proposition 4.1. Let f : C ^ C be a polynomial function of degree < 2, with 
discriminant A : f{z) = -h + c, a, Z?, c € C, A = — Aac. Suppose A = 0. 

Then f has a square root p : C ^ C which is a polynomial of degree <1:7/' 
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a ^ 0 and Uq = a, then p{z) = ^oz + ^ = ^oZ + ^ degree p = 1; 

ao ^ 0. If a = 0 and Cq = c, then p(z) = cq => degree p = 0; here a = 0, 

A = 0^^ = 0=^ f{z) = c. 

Given polynomial functions o', a, f : C -> C with degree a, a < 1, degree 
f < 1, and given /r G C, define A : C -> C by ^ +kg-g. Thus A is a 

polynomial function of degree < 2. More explicitly, write g(z) = oq •+■ 

^(z) = bo + biz-\-b2Z^,f(z) = co + ciz\( 7 '(z) = ai+2a2Z (f(z)-(y'(z))^ = 
(co -ai + (ci - 2fl2)^)^ = (co -aif + 2 (cq - ai)(ci - la2)z + {c\ - 2a2fz^ => 
fAz) = 

Therefore if A(^) is the discriminant of A» the equation A(/r) = 0 is a quadratic 
equation in k (which might in some special case reduce to a linear equation in 
k). Assume A(^) = 0. Then, by Proposition 4.1, we know that we can find a 
polynomial function p such that p^ = A^ degree p < 1 : If f^{z) = c + bz + az^ 
with « 0 we can take p{z) = ^oZ+ ^ 2 ^ where al = a, ao ^ 0; if 

a = 0, then A(k) = 0 => b = 0 => f^iz) = c, and we take p{z) = cq, where 




Let ^ ± p. Then degree < 1 and + (t - g')7t'^ = ± (g' - 

t)p + p^ + (f - <7')(^) ±{r- c')p = ~ 

which proves 

Proposition 4.2. Let k e C such that A(k) = 0 where A(k) is the discriminant 
of fK ^ + KG - G. Thus, by Proposition 4. 1, we can choose a polynomial 

p of degree < 1 such that p^ = A- ± p. Then degree < \ and 

+ (f - g')jz^ + g - kg = 

Again let ^ G C be a solution of A{k) = 0, where A(/r) is the discriminant of 
Then we can find a solution ;r of ;r^ + (f - g')7t -h g - 

KG = 0, 7t{z) = a + Pz, G, p G C; for example, we have solutions ;r = ;r^ by 

def def 

Proposition 4.2. Set i = ^ + ;r' G C and set r = f + 2jt\ then degree t < 1. Also, 
on a domain D on which g is non- vanishing, let ^ be a solution of cp' = 4 >^/g : 
^{z) = * on D. Then 

Proposition 4.3. 

G^p" GTp' + G(p = 2g 4> on D (4.3.1) 

and 



2g^P' -f GTp = GpT on D. 

Proof We compute on D: = (^)' (^)' = ^ ^ ^ => 

GJi:' - kg' = g^^ — k^ => “ on' — n'^ - kg' —tk kg — g"^= - 

4> <p 

TK+{2 — k')g — G ==> G^^ = Ag - G — TK => G^p" = iCT0 — Gp — 

pTK AgP — G(p — TG(p' => G^p" -h GTp' Gp = AGp, as claimed. Also 

2g^P' + GTp = 2 g^P^ + GTp - 2 gPk + GTp - Gp{2n + f) = GpT. □ 
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Let w, y be functions related by w = on D. Then u' = u” = + 

y'^' + y^" + ^'y' = ^y"+2^'y' + y^" ==> -{-(jfu' + du = cr^^y' + 2(j^^'y + 

CF^y<p"+af(py-\-(jf^'y-\-d4)y = (7^0y" + (2cr^0'+(Tf0)y' + (cr^^"+t7f^'+cr^)y = 
t7^0y" + (cT0T)y' + Aa<py (by Proposition 4.3) = a^{ay” + ry' + Xy). Since a 
never vanishes on D we have thus proved 

Theorem 4.1 (Reduction to canonical form). Let D be a domain on which o is 
non-vanishing. Then on D, 

a^u" + afu' + GU = 0 cry" + ry' + iy = 0, (4.3.2) 

for u, y related by u = (fy. 

Equivalently 

u" + -u' + - 0 cry" + ry' + iy = 0. (4.3.3) 

o 

Proposition 4.4. On the domain D on which g is non-vanishing, given a solution 
^ of 4>' = cind a solution h of h' = set p = ^e^. Then p is a solution of 

dcf 

(per)' = pT on D. Recall t = r In. 

Proof, (per)' = + 2^(p'e^ = +20 pf + 2np = 

p(f + 2n) pT. □ 



In applying Theorem 4.1 the reader obviously must be clear about the notation 
and definitions established. For convenience we summarize the reduction proce- 
dure, reiterating how one obtains r and A in (4.3.2) (the second equation there 
being equation (4.2.2) of course). We are given the coefficients cr, cr, f in (4.2.1). 
Given k eC one chooses a square root p of the polynomial 



^ def (t - a'f 
Jk = ^ + Ka- a. 



(4.3.4) 



assuming that the discriminant A(/r) of vanishes; see Proposition 4.1. Let 

± def cr' — T 



±p 



(4.3.5) 



and let n be one of the choices or ;r . Then 



T f + 2n, A^= K n\ 



(4.3.6) 



with degree n,r < 1, 2 € C. Also the solutions w, y in (4.3.2) (i.e., in (4.2.1) 
and (4.2.2)) are related by m = 0y where 0 satisfies 
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on a domain D on which cr is non- vanishing. For example one can take 



^ = exp 



f 

<^(z) 



dz 



) 



(4.3.8) 



The reduction to canonical form involves certain choices of signs. For example, 
(with A(^) = 0) has two square roots: ±p. There is a more or less standard 
way of making the choice of signs. In some basic examples from mathematical 
physics one has in fact that r' < 0 and r has a root on some distinguished interval 
aiob (possibly a = -oo, /> = oo) on which cry" -h ry -h Ay = 0 is defined. Thus we 
will generally choose our signs so that t' < 0 and such that r has a root in (a, h). 
Further justification for this is given by the following lemma. 

Lemma 4.1. Let t{x) be a real-valued polynomial function of x of degree < 1. 
Let (j, p be real-valued continuously differentiable functions on a closed interval 
[a,h] (a,b 6 R, a < b) such that {pG)\x) = {pr){x) with p(x), cr(x) > 0 on 
(a, b). Suppose also that p{x)a{x)x^ ^ ^ at x = a, b,for A: = 0, 1. Then r' < 0. In 
particular r is non-constant; Le., deg r = 1. Also r has a root in (a, b). 

Proof On (fl, Z?), (pr^)(x) = ((/7 t)t)(x) = ((/?o-)'r)(x). Therefore (/?t^)(x) = 
((Pct)'t)(x) on [a,b\, by continuity (i.e., {pafix) = (pr)(x) on [a,b]). Here r, 
{pa) are differentiable on [a, b] with r', (pa)' integrable there. Therefore integrate 
by parts: j^ipr^Xx) dx = J*[t(p(t)'](x) dx = rp(r(x)]* - dx = 

-r' l^(pa){x) dx by (t). We note that r ^ 0 on (a,b). Otherwise (/?cr)'(x) = 
(t/?)(x) = 0 on (a,b) => (pa){x) = c is a constant on (a,b). By continuity 
(pa)(x) = con [a, b]. By ($), c = (pa){a) = 0 => ipa){x) = 0 on (a, b), where 
/?(x), cr(x) > 0. As this is not possible we must have r ^ 0 on (a, b) : r^(xo) > 0 
for some xq, a < xq < b. Then j^(pr^){x) dx > 0, as pr^ > 0 on [a,b]. Also 

l^(pa){x) dx > 0 [which also follows by the mean- value theorem for integrals]. 
Therefore 



- f^(pr^)(x) dx 

f 7 < 0 . 

L(z^^5-)(x) dx 

We can write (4.3.9) of course as 



(4.3.9) 



'b rh 

(pT^)(x) dx = -r' ipaXx) dx. (4.3.10) 

ja Ja 

Also, as pa is continuous in [a, b] and differentiable on {a, b), and as (pa){a) = 
{pa)(b){= 0) by ($) we can apply Rolle’s theorem: 3 a point xq in (a, b) for 
which (/?cr)'(xo) = 0; i.e., p(xo)t(xo) = 0 r(xo) = 0. That is, t has a zero 
in (a, b). □ 

Remarks. Suppose in the preceding arguments we start with a polynomial func- 
tion / : R R of degree < 2 with real coefficients : /(x) = ax^ bx c, 
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a,b,c e R. If a,c > 0 the above arguments provide for a real square root p of 
/ provided A = - 4ac = 0 : p(x) = gqx + ^ for cq = # 0 if a > 0; 

p(x) = ^/c if a = 0. Thus suppose now we are given polynomial functions a, a, 
f : R M with real coefficients such that again degree a, a <2, degree r < 1. 

If € R, then + Ka - a has real coefficients; deg < 2. Write 

fjc(x) = ax^ -\-bx-\-c,a,b,c€ R, and assume A(^)(= b^ - 4ac) = 0 for /c G R. If 
c > 0, then has a real square root p : p{x) = yfdx + ^ if > 0, p{x) = yfc 

if a = 0. Then ji^ ^ ± p has real coefficients. Also A^= k -{■ n' e R, and 
7T = and T = f + 2 k have real coefficients. Choose ^ = a real solution on D 
of 4>' = ^7r/a, where D = a domain on which a is non- vanishing. For h a real 
solution of h' = fja, p = ^e^ is real: (pa)' = pr on D, as in Proposition 4.4. 



4.4 Solutions of try" + ry' + Ay = 0 

Given the reduction of the general hypergeometric equation (4.2.1) to a canonical 
form (4.2.2), provided by Theorem 4.1, we focus a bit on solutions of (4.2.2) — 
in particular on polynomial solutions. In a unified, uniform manner we obtain 
the classical orthogonal polynomial solutions to equations of Hermite, Legen- 
dre, Laguerre, Jacobi, and others, for examples, as well as recursion properties of 
these solutions; cf. equations (4.2.3) through (4.2.10). These solutions to (4.2.2) 
incidentally will be given in a generalized Rodrigues form\ see (4.4.12) below. 

We provide several examples of the reduction technique of Theorem 4.1 to ob- 
tain results for later applications, and we compute the wave functions and energy 
levels of the quantized harmonic oscillator, verifying some assertions made in 
Chapter 3. 

Let p, ^ : R C be functions of the form p(x) = a + bx, q(x) = a + -f 
(a,b,a,p,y G C; possibly b,y = 0) and let / : C R ^ C be « times 

differentiable, n> 2. Then by Leibniz’ rule 






n\d^p d"-'^f 



k=0 



k I dx^ dx^~^ 



n\ d"f ^ n\ ,d"-^f 



d"f ,d"-^f 

p- h np 

dx^ dx^~^ 



(LI) 



on U, since ^ = 0 for /c > 2. Similarly 



d" ^ I n\ d"q ( n" 

^( 9 /) - ( ( 1 



^ /n\ ^ 

^ cfx"-' \2/^ rfx”-2 



d’'f , ,d"-7 ,«(«-!) 



(L2) 



on U. Thus we have equations (LI), (L2) on U for polynomials p, q with deg p < 
1, deg 9 < 2. 
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We consider the differential equation 

cr(x)y"(x) + r(x)y(x) + Xy{x) = 0 (4.4.1) 

on some open set U c R — an equation of hypergeometric type: (t(x) is a polyno- 
mial in X of degree < 2 and t(x) is a polynomial in x of degree < 1; see equa- 
tion (4.2.2). We suppose we have a function p on U which never vanishes and 
which satisfies the equation 



((jp)' = rp on U. 



(4.4.2) 



Our goal is to construct (using p) a family of particular (polynomial) solutions 
n = 0, 1, 2, 3, 4, . . . , of equation (4.4.1), for A = - nr' - In fact 

set /„ a"p, y„ = /„, y„ are For n = 0, fo = p, yo = jP = I => Jo 

satisfies equation (4.4.1) with A = Aq = 0. For n = I, A\ = -t', f\ = ap, 
- ^((7/?)' = y (by equation (4.4.2)) = t. Thus ay[ -h ry\ + A\y\ = gt” -I- 
tt' -I- (-t')t = 0 as t" = 0 (for degr < 1) y\ satisfies equation (4.4.1) for 
A = A\. 

Theorem 4.2. y„ is a solution of equation {AAA) for A = « = 0, 1, 2, 3, 

Proof For the proof we may assume n > 2, the cases n = 0,1 having already 
been checked. Define 

= T -h no' (4.4.3) 

so that T„(x) is polynomial in x of degree < 1. 

Then 

f/^G^p = a^-^Gp 

fn = + {n- ^ 

= G^~^Tp + (n- pc' by definition (4.4.2) 

= t/„_i + {n- l)G'fn-i = Tn-ifn-\ by definition (4.4.3), 



so that 



def 1 d^fn _ 1 _ 1 ^ ^ ^ 

yn- . J... - , r ._i/n - 



p dx^ pdx^ 

1 

^ c/x"-' 

by (LI) (since degT„_i < 1) 

d«-2 



dn-\ ^ ^n-2 



1 n-1 

~'^n—\Pyn~V "b r. 
P P 






fn- 



, 1 d""'^ 

= T„_iy„_i + (« - 1)Vi-TT7-^«-i- 
pdx"-^ 



(4.4.5) 
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Similarly write 



yn= - 
p 

_ 1 
p 



1 d^fn 1 



p dx” pdx”^'^^ pdx"^^^"'^ 



1 d" 



rin-2 



d^fn-i . ,d^-^fn-i . n{n-\) „ d^ ^ 

^ ; — I ^ r/«-i 

dx^ dx^~^ 2 dx^~^ 

by (L2) (since deg (t < 2), 



where 



dx^ dx dx^ 



'^^{pyn-iy^py'„.i+p'yn- 



, , , , ,«(«-!) „l , 

yn - + «C7 y„_, + 



and where by equation (4.4.2) 



(jp A-a p - rp 



(jp 



= T - (7 . 



That is, 



n(n-l) 1 

= ^y;_i + [T + (n- l)^']y„_i + 



p dx^ 



On the other hand, 

An-l 



,, _ (n-l){n-2) 



, def -(« - 1)^’' 2 



(n-l) 



+ r' + (n- l)^7" 



n(n — 1) 



= -t' - ^ + t' + (n - 1)(7'' = 

<7" = A„_1 +{n- 1)t;_, =» (by (4.4.3) and (4.4.5)) 

yn = (^y’„_i + [f + (m - \)a']y„-\ 



+ (« - 1)t„, 



, 1 d' 



<n-2 



^ p dx^~^ 



fn-\ + An-, 



1 d' 



m—2 



p dx^~ 






, , 1 ^ 
- <^Vi + + 'l"-! 



p dx 



so that 






for n > 2; i.e.. 






rf”-' 
' dx" 



(4.4.6) 



(4.4.7) 



(4.4.8) 



(4.4.9) 



(4.4.10) 



for /I > 1, 
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which we differentiate to obtain (by (4.4.2)) 

f^py'n +y'n'^P = (ppy'j “ ^npyn, i-C., (4.4. 1 1 ) 

ax^ 

<^y'n + ^y’n + = O- 

which proves the theorem. □ 



We reiterate that for p such that (ap)' = rp, « = 0, 1, 2, 3, . . . 



def 1 n 



(4.4.12) 



Note also that by equation (4.4.5), y„+i = r„y„ + so that by equa- 
tion (4.4.10), A„y„+i = k„t„y„ + py'„) => 

Corollary 4.1. 

^nyn+i = ^r,r„yn ~ n<7T'„y'„fom > 1. 

Here both sides are zero for n = 0. Also (by equation (4.4.10)) 

<ypy'„ = -k„-—{a"p)forn> 1 . 
dx"~‘ 

Proposition 4.5. is a polynomial on U of degree < n. 



Proof yo = ^/? = Eyi = \i^py - I'^P = the proposition is true for 

« = 0, 1. To finish the proof we first observe 

Lemma 4.2. Let k > \ be an integer and let r be a dijferentiable function on U. 
Let fk,r = (y^ pr. Then fl^ = (7^~^p[(k — 1)(tV + rr + or']. In particular ifr is a 
polynomial on U of degree then (k — l)o'r + rr + or' is a polynomial on U 
of degree <1 + 1. 



Assume the lemma for now. Then for /„ = o'^p = fn,\ (i.e., take r = 1 in the 
lemma), = o'^~^pr\ for « > 1, where r\ = (« - l)o-' + t is a polynomial of 
degree < 1. For n > 2 (i.e., « - 1 > 1) apply the lemma to /' = o''~^pr\ = 
fn-\,n • /« = where ri is a polynomial of degree < 2. For « > 3 (i.e., 

« - 2 > 1) apply the lemma to /" = o"~^pr 2 = fn- 2 ,r 2 • fn = o'^'^pr^, where 
r 3 is a polynomial of degree < 3. Similarly for n > p, f„^^ = o'^~^prp, where 
is a polynomial of degree < p. Therefore f^^ = o''~'^prn = prn, where is a 

polynomial of degree < n => y„ = is a polynomial of degree < n, 

as desired. □ 
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Now we prove the lemma. If A: = 1 we simply have 

{a pry = (o’p)V + opr' = Tpr + apr’ by (4.4.2) 
= p{rr + or'), 



(4.4.13) 



as claimed. 

Therefore assume k>2. 

f'k,r = 

= + cr*"”* {[<rp]V + [M'"'} 

— {k — \)(j^~^a'apr + <j^~^ [rpr + apr'^ by equation (4.4.2) 

= {k- \)a'^-'^c'pr + (r'=-'p [rr + <t/} 

= {(fc — l)crV + rr + err'} , 

(4.4.14) 

as desired. 

Given condition (4.4.2) we note that equation (4.4.1) (cry" + ry' + 2y = 0) 
holds 

- 4 [P<^y'] + Ay = 0 (4.4. 15) 

p ax 

holds. (4.4.15) is called the adjoint form of (4.4.1). 



4.5 Examples of the Reduction Technique 

Consider the time-independent Schrodinger equation for the harmonic oscillator; 
see Section 3.2, Chapter 3: 

6x^ d^ur 2m ( Bx^ \ ^ ^ 

F(X) = ^ ^ 2 ^ = 4^^'”- (4.5.1) 

This has the form (4.2.1): 

u"{x) + ^Y^u'ix) + -^^m(x) = 0 (4.5.2) 

<7(x) a^{x) 

on R for M = i//, f = 0, o' = l,d-(x) = - ^). We employ the nota- 

tion (4.3.4)-(4.3.8). 

As f, cr' = 0, +K(7-d = K-a for /c G R : /^(x) = k-OE+^-x^ for 

^ def ^ = ax^ + hx-\-cioi a= y = ^ > 0,b = 0,c = k-9E => 

A(^)1l^^2^4flc = -4f(/c-6>£).ThusA(/^) = 0 ^ k- 6E = 0 c = 
0,fc = 6E. 

p(x) \fdx =» - ax^ - y (as k - 9E = 0). 

def ' ~ 

± p = ±p. If we choose the minus here (we also choose the positive 

def 

square root of a) then r = f + 2 k~ = -2p will have a negative derivative: p(x) = 
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=» t ( x ) = -2^^x t '( x ) = - 2 ^^; compare remarks preceding 

Lemma 4.1. Take (as in (4.3.8)) 



4>(x) = ^ 

^ M = — (which IS (4.3.7)). 

(j{x) 



(4.5.3) 



Also h = 0 satisfies /i' = f/cr (since f = 0). Thus we can take p{x) = = 



( t { x ) 

ftip 2 dcf 1 

■ “T"^ in Proposition 4.4 : {po)' = pr. For « > 0 an integer, = - m' - ^n{n - 



1)(7" = 2n^.Let 



yn{x) = 



1 d” 



V^„2 d” V5 Zv2 






( 4 . 5 . 4 ) 



Then, by Theorem 4.2, we know that is a solution of try" + ry' + 2„y = 0 and 

thus for 2 = (A /c + {jr~y - QE - \fa - QE - ^^^) we obtain a solution 
y/^ = ^y„ of w" + + -^w = 0, by Theorem 4.1. The condition A = A„ is 

0E-^= 2n^, otE = E„ • 51 ' ^(2n+l)^ = {n+\)^ln. 

For P 4;r^v^m as above and h ^ we obtain E = En = (n + j)vh. Thus for 



def 



E = En^c have the solution i// = i//„ = <py^ of equation (4.5.1): 



d^y/fi 2m 
dx^ h 






y/„-0 



(4.5.5) 



on R. Let J¥„ be the nth Hermite polynomial of (4.2.13): H„{x) (-1)" ^ 

on R. If g(x) = and gb(x) — for b>Q, then gfc(x) = g(Vbx) => 
g'b(x) = g'(Vbx)Vb, gl(x) = g"(Vbx){Vbf,...,gf\x) = g*-’'\Vbx)iVb)" . 

Choose b = 



y„{x) = e”^\f\x) = (Vbre^^\^”\Vbx). (4.5.6) 

But 

H„(x) = (-l)"e"'g<">(x) =» H„{\Tbx) = (-l)"e*"'g<">(Vix); (4.5.7) 

i.e., 

y„(x) = (-l)"(Vft)"lf„(V*x) 

^ (4.5.8) 

=> iy„ix) = 4>{x)y„{x) = e-^^\-m\rbTH„{^fbx) 



for 6 = 



_ 



2nvm 



4ju^v 
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We have thus obtained the energy levels E = En = [n vh for the quan- 
tized harmonic oscillator (as advertised in Section 3.2 of Chapter 3) and the corre- 
sponding wave functions given by (4.5.8). The y/n will be normalized later. We 
also see that the quantum condition for the energies is precisely the mathematical 
condition A = An of Theorem 4.2. 



Example 

Consider the differential equation 

L±^r^F[)(r) + ^Fi(r) + C (Ei - F,(r) = 0 (4.5.9) 



on = {r 6 Rlr > 0} for a, C, c, Ei G R. We assume -a = /(/ + 1) for / > 0 
an integer and we assume C > 0, c, Ei < 0. Write equation (4.5.9) as 



^ 2F[{r) [a + r^C(Ei-f)]Fi{r) 

Fiir) + — — + = 0 . 



r r‘- 

at 



Thus a(r) = r, f(r) = 2, cr(r) = a + r^C{E\ - £). From (4.3.4) 

r / \ def ['T , , , , ^ , X 

Mr) = (r) + K(T(r) - (j{r) 



= \ + Kr - a - r^C 
4 






= C(-Ei)r^ + (/c + Cc)r + -- a 



A{k) = (*: + Cc)M4CEi ( ^ - a ) , 



(4.5.10) 



(4.5.11) 



which we set equal to 0: (sr+Cc)^ = 4C(-£i)(| +/(/ + 1)) (where the right-hand 
side is positive by the assumptions Ei < 0, C > 0, / > 0). Let v 4C(— £'i)( j + 
/(/-I- 1)) so that KT = -Cc± V^, with | + /(/ + 1) = (1 + ^)^-, i.e., -/v = y/Ji{2l + l) 

def 

for n = C{-Ei) => K = -Cc± v^(2/ -I- 1). We get 
Mr) = Mr^ ± Vvr + (l + ± ^/fi(2l + l)r + (l + , (4.5.12) 



so that p(r) - ^/Jir ± (/ -I- j) satisfies = f^. By (4.3.5), (4.3.6) 
n{r) = n^{r) — ^(r) ±p(/-) 



def — T 



4±p(r) = -i± 



Mir ±\l + - 



(4.5.13) 



def 



r(r) = f(r) + 2n{r) = 2 -h ln{r). 
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If we make the choices /r = -Cc-\[v,k - n — i.e., ;r(r) = -\-p{r) = -\- 
= - Viwr+/, then r(r) = -2^r +2/ +2 will satisfy r' < 0, r(r) = 0 

for r = ^ e (0, oo); compare Lemma 4.1. Then k + k' = -Cc - Vv - ^ = 

-Cc - ^{21 + 1) - -y/jw = -Cc - 2(1 + 1)V^, Ap - pr' - = 2py^, 

p = 0,l,2,3... (see (4.4.12)). Thus 



A — Ap 

-Cc = 2(/ + P+ l)Viw 

C^c^ = 4(1 + p+ 1)^/^ (since \Cc\ = -Cc for C > 0, c < 0) 
-Cc^ 



Ex = 



4(p + / + l)2’ 



as // = - CE\. 



(4.5.14) 



By (4.3.8) and Proposition 4.4 



^(r) = exp 
h(r) = 



(r)dr ) = exp 



+ / 



dr ) = r^ exp(-vT^r), 



-(r)£?r = 

<T 



\2 

-dr = 21ogr 



exp(-2v^O 2 

p(r) = — — exp(/i(r)) = r 

o-(/-) r 

= r^'+‘ exp(-2VC(-£i)/-) on R+. 



(4.5.15) 



Then 



_ ^-(2/+i) |^p+2/+i exp(-ar)] 



(4.5.16) 



for a = 2^/C(-El) = 2y/Ji. On the other hand, for ^ > 0 an integer we have the 
Laguerre polynomials of degree p given in (4.2.12): 



dP 

Ll(x) = — — (c"^x^+^) onR+. (4.5.17) 

p\ dxP 

Let f(x) = onR“‘‘,g(x) = f(ax),a e R"^. Theng^"\x) = /^"^(ax)a" => 

f^^\ax) = a~^g^^\x). That is, 

Llix) = e^^f^\x) 

p\ 

=» Lliax) = 

p\ 
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def e“x-" d” 

= — - — e “^(ax)^^'' 
/?! 



dP 

dxP ' 



p\ 



(4.5.18) 



We can therefore write yp(r) = p\ By Theorem 4.2, yp is a solution 

of the canonical form ay" + ry' + Ay = 0, equation (4.4.1), provided A = Ap — 

i.e., E{ = in which case we obtain the solution F\ = Up ^yp of 

equation (4.5.10), by Theorem 4.1: 



Up(r) = exp(-Vc(-£i)r) p\ Ll^'^\2y/C(-Ei)r) for 
-Cc^ 



El = 



(4.5.19) 



4(p + / + l)2 



Now specialize the parameters; choose C = where p - is 

the reduced mass of two particles, say a nucleus of mass m\ and an electron of 
mass m2; choose c = q\q2 where q\ = Ze for -e the electron charge, q2 = -e. 
Here Z = number of protons in the nucleus (also called the atomic number). Thus 
c - -Ze^ < 0. Given an integer « > / + 1 take p = n - (/ + 1) > 0 (an integer). 
Set 

Cc^ 

E^"^ = (see equation (4.5.19)) 

4(p + / + l)2 V 4 V 

= (4.5.20) 

2 n^nZ^e‘^ 

~ /i2«2 ’ 

and set 



an ^ 2VC(-£(")) = 

p„(r) = a„r onR''". 



(4.5.21) 



By the above discussion we obtain solutions Uni of equation (4.5.9) (for C = 
^ ~^,a = -1(1 + 1), c = -Ze^) given by 

Un,(r) = r‘ exp(-a„r/2)[n - (/ + 1)]! (4.5.22) 

provided E\ = E^"\ The solutions w„/ will play a vital role in the next chapter. 



4.6 Examples: Hermite, Legendre, Laguerre, Jacobi 

In the Hermite case (see (4.2.4)) 

a(x) = 1, t(x) = — 2x 



(4.6.1) 
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=> = 2n; Tn T + n(j' = t, p(x) = e 

and therefore y„(x) = Corollary 4.1 gives i„y„+i = 

for n > 0. That is, 2ny„+i(x) = 2n(-2x)y„(x) - n{-2)y^(x), or y„+i(x) = 

-2xy„(x) + y^(x) for « ^ 0 , which we multiply by (-l)""^^ : = (-l)"y« 

JT„+i(x) = 2xHn(x) - iT'(x) : (4.6.2) 



Corollary 4.2. 



if,+i(x)-2x^,(x) + i/;(x) = 0^r n>l; 



this also holds for n = 0. Also Hn is a solution of equation (4.2.4) /or 2 = 2n 
there. 

In the Legendre case (see (4.2.3)) 

(j(x) = 1 - x^, t(x) = -2x (4.6.3) 

=> 2n = n{n + 1); t„(x) = -2(n + l)x, p{x) - 1 



and therefore y„ = ^(1 - x^y. 






i)" = 



(-1)" d" 
2"n! dx" 



<T(X)" 



(- 1 )" 

2^n\ 



yn(x) 



(as p(x) = 1). 



Corollary 4.1, namely A„y„+i = A„r„y„ - nar'^y'i^, becomes (after division by A„) 
yn+iM - -2{n + l)xy„(x) + 2(1 - x^)y^(x) for n^O. (4.6.4) 



1 ')” f — —1 

For c„ = c„+i = = tUhT)^"- ^hus multiply the formula for y„+i(x) 

by c„+i to get 

P„+i(x) = xP„(x) - ^i^P'(x) : (4.6.5) 

n + L 

Corollary 4.3. 



(1 — x^) 

F„+i(x) - xP„(x) + — — — P,;(x) = Ofor n > 1; 
n + i 

this also holds for n = 0. Also P„(x) is a solution of equation (4.2.3) /or X = 
«(n + 1) there. 

In the Laguerre case (see (4.2.5)) 



cr(x) = X, 
Xfi — n 



r(x) = a + 1 — X, p(x) = x^e x > 0 (4.6.6) 
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so that 



ynix) = - n!I“(x) 

dx" dx" 

is a solution of 

xy”(x) + (a + 1 - x)y (x) + ny(x) = 0 on by Theorem 4.2. (4.6.7) 

Also cr'(x) = 1 Tn(x) = t(x) + n = a- x + n+l. 

Corollary 4.1, namely - ncrr'^y'^, gives (after division by n) 

y„+i(x) = (a-x + n+ l)y„(x) + xy'^{x) for « > 1, (4.6.8) 

which we divide by (« + 1)! = (« + 1)«! to get 

(4A9) 

Corollary 4.4. L" is a solution of equation (4.2.5) for A = n there. Also 

(n + l)L^^i(x) -(a-x + n+ l)L«(x) - x(L“)'(x) = 0/or n > 1. 

We consider next Jacobi polynomials. Define cr(x) = 1 - x^, r(x) = -(a + / + 
2)x + / - a,p(x) = (1 - x)^(l + x)^ on (-1, 1) for a,feR fixed. Therefore 
p(x) 7 ^ OV X 6 (-1,1). Usually we will take a,f > -1. By straightforward 
differentiation one has 

Proposition 4.6 (cf. condition (4.4.2)). 

(p^y = pT on (—1,1) and 

An - nr' — ^cr" = n(a + f + n + 1). 

Let y„(x) = -^^£^((^%x)p(x)). That is, y„(x) = (1 - x)““ (1 + x)"^ 
x)«+a(i ^ Then, given Proposition 4.6, we know by Proposition 4.5 and 

Theorem 4.2 that y„(x) is a polynomial in x of degree < n, and that, on (-1, 1), 
is a solution of 

(1 - x^)y"(x) + [-(a + f + 2)x + f - a] y'(x) + py(x) = 0 (4.6.10) 

(i.e., of the equation ay" + ry' + py = 0) provided n(a + ^ + n + 1) = p (i.e., 
— P)‘ 

Equation (4.6.10) is JacobVs differential equation; see (4.2.10) . Recall that 
ay" + t/ + /iy = 0 <=» + ppy = 0 (= adjoint form). Thus equa- 

tion (4.6.10) is also written in the adjoint format (see (4.4.15)): 



1 J r /I 

~p lx J + py = 0; 



(4.6.11) 
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i.e., 

(1 - x)-“(l + [(1 - xr+'(l + + ny{x) = 0 

is the format in some books. From (4.2.14) we have the Jacobi polynomials 

- ^(1 - [(1 - (4.6.12) 

2"n! dx^ ^ ^ 

on (-1,1). We see that Ph^'^^ is a polynomial of degree 

< n and Pn"'^^ is a solution of Jacobi’s differential equation (4.2.10) provided 
n(a + /] + n + 1) = p. 

One actually has that deg = n. Note that Ph^'^\x) = ^ (1 ” = 

^^(x^ - 1)” = P„(x), the Legendre polynomial. Similarly for choices of 
a, P, Pn^'^^ reduces to polynomials of Chebyshev, Gegenbauer, and so on. 



4.7 Orthogonality 

For integers n, m > 0 define fnm on U in Section 4.4 by 



fnm = (j(x)p(x)[yn(x)y'^{x) - ymMy'nix)] 
= o{x)p{x)W{yn,ym){x) 



where n^(y„,y^)(x) = 



yn(x) ym{x) ^ 

y'nM y'mM ’ 



i.e., W{yn, ym) is the Wronskian of (y«, ^m)- Then 



fL = <xp[yr,y'm + y'ny'm - ymy'n - y'my'n\ + i<xp)' [y„4 - y„y'„] 

= pyn[<xyl + ry’J - py„[(Ty'^ + ry'„] by equation (4.4.2) 
= -^mpynym + 2„/9y„y„, by Theorem 4.2. 



(4.7.1) 



(4.7.2) 



That is. 



fnm = (^n ~ ^m)pynym On U. (4.7.3) 

4.7.1 Laguerre Orthogonality. Consider the Laguerre case, for example, where 
we take U = R'^ = (0, oo): (t(x) = x, t(x) = a + 1 - x, p(x) = x®c“^. An = «, 
for X > 0; we take a > -1; = «!L®, as noted earlier. By Corollary 4.1 



(<Tp)(x) n\{Ll)'{x) = -«£^(<7'’(x)/t(x)) = -«^(x'’+“e-*) (4.7.4) 



dx" 



for M > 1. On the other hand, by (4.2.12) 






(«-!)! 



-x-“-‘e^ 



jn-l 



dx"' 



_(g-Xx«+«) 



(4.7.5) 



for rt > 1, which gives 




4.7. Orthogonality 



99 



Proposition 4.7. 

a“)'(x) = -L“!;(x) forx>0,n>l- 

hence 



{n + l)L^+i(x) - {a -x + n+ l)L“(x) + = 0. (4.7.6) 



by Corollary 4.4, 

For a + 1 > 0 (i.e., a > -1) limx_^o+ and e~^x^ dx converges (to 

r(a + D). 

For ^ > e > 0, equations (4.7.1), (4.7.3) give 



(n-m) 



rb 

x^e-^Llix)Ll{x)dx 

Je 



^a+l 



e-^W{LlLl)ix)]l 



(4.7.7) 



where we have cancelled n\ m\ on both sides. We may therefore let c ^ 0"^, 
^ ^ 00 to obtain 

poo 

(n-m)\ x^e~^Ll{x)Ll{x) dx = 0, (4.7.8) 

Jo 

as W (L®, L^){x) is polynomial in x. That is, 

Proposition 4.8. For a > -I, and integers n,m>0 



• 00 

x^e~^ L^(x)L^{x)dx converges, 

*0 



and is zero for n^ m. 

By definition 

L“(x)x“e-^ = i^(e-x-«). (4.7.9) 

Given any polynomial p we have noted in the argument leading to (4.7.8) that 
a > -1 =» [x"^^ pix)]"^ = 0. Therefore 



5 1 fOO m 

p(x)L“(x) x“e”* dx = — \ p{x)— ( 
n\ Jq ax 

claim W [° 

n\ Jo 



^-x^aVn 



) dx 



•00 


' (P 


.0 





(4.7.10) 






) dx, 



where the claim is true for « = 1 (one integrates by parts), and follows by induc- 
tion on n, using that 



£_ ^ 2 + « + l)(n - 1 + a + 1) 

^ k=0^ ^ 

•(M-2 + a+l)---(fc+l+a + l)x“+‘+'‘ 



(4.7.11) 
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[or more simply that ^ (e n!e which implies 



d" 






= 0 . 



(4.7.12) 



By Proposition 4.7 (L«)'(x) = (x) for « > 1. It follows that (the 

kth derivative) = for n> k.ln particular 

= (-1)" and {L^J^~^\x) = + n-x) (4.7.13) 

as L^{x) = 1, Lj (x) = 1 + a - X. For the choice p = L^in (4.7.10) we therefore 
see that since c“^x®+" dx = T(n + cr + 1) we may conclude in conjunction with 
Proposition 4.8 

Theorem 4.3 (Laguerre Orthogonality). 



x"c-"L«(x)L^(x) dx = S, 



r{n + u + 1) 



nl 



for a > for integers n,m>0. 

In the study of the hydrogen atom (in Chapter 5) we shall need the following 
formula, where as usual we assume a > -1. 

Theorem 4.4. For n>0, 



x“+'e“^(L“(x))^ dx = 



{2n + a + 1) T(n + a + 1) 



n\ 



Proof Choose p(x) = xL“(x) in (4.7.10) and use (4.7.13): For n> I 

/"\x) = (^"^x(L“)<">(x)+ 

= (-l)"x 4- n{-iy~^(a + n- x) 

= (—!)"(«+ l)x + (-l)"“^«(a + «) for « > 1 => 

xL“(x)L"(x)x“c“^ dx 



(4.7.14) 



(-ir 



ni 



[(-!)"(« + l)x 4- {-iy-^n(a 4- «)]e“^x“+'* dx 



0 



1 



(4.7.15) 



= — [(« + 1) T(n + a + 2) - n(a + n) F(n + a + 1)] 
n\ 

= — r{n + a + l)(2n 4- a + 1), 
nl 

since F(z + 1) = zF(^). We have assumed « > 1 in appealing to (4.7.10). But for 
n = 0 Theorem 4.4 is simply the assertion 



dx = r{a 4- 2) =’ {a 4- l)F(a + 1). 



(4.7.16) 

□ 
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In the preceding proof we chose p(x) = xL^(x). Now choose p(x) = x^L^(x): 



x“+^e- 



{L“{x)f dx = 



(- 1 )" 






(4.7.17) 



by (4.7.10). Now 



p^"\x) = x2(L“)<"Hx) + «2x(L“)("-')(x) + ^^^^^2(L“)("-2 )(x) 

by the Leibniz’ rule; cf. (L2) in Section 4.4 of this chapter. That is, using (4.7.13) 
and more generally {L^)^^\x) = (-l)^L®;^'^, « > A:, we get = (-l)"x^ - 

(- 1 )"2nx(a + n - x) + n(n - 1 )(- 1 )"L“+"“^ (x), where (x) = 2 (^ + 1 )(/l + 2) - 

2 

(P + 2)x + y . After simplification we obtain 



(-l)V"’(x) = 



(n + \){n + 2) 



( 4 . 7 . 18 ) 



which we plug into (4.7.17). Using also that 

dx — F(m + a + 1), a> -1, 
we see that the right-hand side of (4.7.17) becomes 



(4.7.19) 



n\ 



(«+!)(« + 2) 

2 



r(n -h a + 3) — n{n -h l)(n 4- a) F(« -j- a + 2) 



n{n-l){n + a- l)(n + a) 

H r{n + a+l) 



(4.7.20) 



where F(.^-h 1) = z F(z) ==> F(«-fa-h3) = («-l-a+2)(«-l-a-l-l)(n-l-a) F(/iH-a), 
F(« -H cr + 2) = {n + a+ l){n + a) F(« -h a), F(« -ha-l-l) = («-l-a) F(« -f a) => 
(by (4.7.17)) 



•0 

n\ 

.0 



X«+2g-X(^«(x))2 



{n+l)(n + 2) 



= (n + a) T{n + cr) 



- n(n -h 1)(« 4- a)(« 4- a 4-1)4- 



(« 4- a 4- 2)(« 4- a 4- 1) 



(4.7.21) 



n(n - 1)(« 4- a - l)(n 4- a) 



The product 2[ ], where [ ] is the bracket on the right-hand side of (4.7.21 ), 

simplifies to I2n^ 4- 12«cr 4- 12n 4- 2a^ 4- 6a 4- 4, for which we obtain 
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Theorem 4.5. For a > -1, « = 0, 1, 2, 3, . . . 

[ dx = "t [- 5^2 _|_ _l_ 2 j 

Jo 

Corollary 4.5. k, m be integers with m> k >0. Then 

poo I 

^/c+ 2 g-x(^fc (^))2 ^ , [ 6 m^- 6 mfc + fc^+ 6 m- 3 fc+ 2 ]. ( 4 . 7 . 22 ) 

Jo (m-fc)! 

Hence for integers n, I with n>l, 0<l<n— Iwe have 

[ (x ))2 dx = [ 6«2 - 2/2 - 2 /] 

_ 4w2(n + /)! 

“ («-/-!)! 

Proof Theorem 4.5 implies (4.7.22) immediately, and equation (4.7.23) follows 
from (4.7.22) by the choices A: = 2/ + 1, m = n + /. □ 

Formula (4.7.23) is the key result needed to compute the average distance of 
the electron from the nucleus, as we shall see later. 




4.7.2 Jacobi and Legendre Orthogonality. In the Jacobi case take U = (-1,1): 
(j(x) = 1 — x^, t(x) = -(a + P + 2)x + p - a, p(x) = (1 - x)®(l + x)^. 
An = n(a + ^ + « + 1); cf. Proposition 4.6; we take a, P > -1; as before 
y„ = (-1)" n\ 2" By Corollary 4.1, 



(C7p){x)i-ir n\ 2" 



j«-i 



= -n(a + p + n+ 1) -[(1 - x2)"(l - x)“(l + x)^]. 

dx^~^ 



On the other hand by definition (4.2.14) 

\n-\ 

‘n-l 



p(a+l,fi+\)^x) = ^ 



2«-‘(n- 1) 



(1 - x)-““'(l + x)-^“‘-^[(l - x)“+" 



dx^ 



(4.7.24) 



d+x)^+"] 

(4.7.25) 



for rt > 1, which gives 

Proposition 4.9. 



on (—1, l)for n> 1. 

For a, 0 > —I, — x)®(l +x)^ dx converges (to 2^'^^'^^B(a + 1,^+ 1), for 

B = the beta function). It follows that 

I p(x)Pn^'^\x)Pm'^\x) dx Converges. (4.7.26) 
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By equations (4.7.1), (4.7.3) 

[n{a + P + n+l) - m(a + ^ + m+ l)] \ Pn^'^\x) Pm'^\x) p(x) dx 

•*-" ^ (4.7.27) 

= (1 - x)“+'(l + pL^'^\x)Y 

-€ 

for 0 < £ < 1. Let e- 1"; the right-hand side of {A .121) approaches 0 since 
a,P > -1. Also if n ^ m, then n(a + /? + «+!) 7 ^ m(a + p + m 1); in fact 
n(a + p + n+ 1) - m(a + p + m+l) = (n- m)[a + ^ + n + m + 1]. That is, 

Proposition 4.10. For a,p > -1, and integers n, m > 0 

|'a-.r(H-,/Pr>We"W^* co,.er,es. 

and is zero for n ^ m. 

Similar to equation (4.7.10) one has by induction on n (for a,P > -1) 
p(x)PY’'^\x)il - x)“(l + x)^ dx 

. -1 

= ^ [' dx (4.7.28) 

2"/t! J_i ax" 

= ^(-1)" [’ - x)“+"(l + x)''+"] dx 

2"a! J_i 

for any polynomial p. Choose p = Ph^'^^ so that by Proposition 4.9 
p^^\x) 

(a + p + n-\- l)(a -\-p-{-n + 2) — -{a + p + n + n) (a+n,p+n). . 

= 2n W (4.7.29) 

_ (a + p + n+ l)(a -\-p + n-\-2)-"(a + P + n + n) 

~ ^ ’ 

which implies by (4.7.28) 



I ^ [pi“’'’*(x)]^l-x)“(l+x)^rfx 

{(X P -Y n 1) ' • • {(X P n n) 

2^"a! 

_ {a p + n + 1) • ' • (a + P + n + n) 
” 2^\ 

• 2"+^+^"+^ B(a + a -f 1, /? + « + 1); 



a-x)-(l+x)^-c/x (4,30) 
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Now r(z + m) = z(z + l)--{z + m- 1) r(z) for m = 1,2, 3, . . . , z ^ 0,-1, -2, 
-3, . . . , again as T{z + 1) = z r(z). Choose z = a + P + n + l, m = n+1: 
r(a + (] + 2n + 2) = {a + p + n-\-l){a-\- l) + n + 2) ‘ - (a + p + n + n)(a + + « + 
« + 1) r(a + ^ 1). Thus given Proposition 4.10 and (4.7.30) the following 

conclusion is valid: 

Theorem 4.6 (Jacobi Orthogonality). 

- x)“(l + x/ dx 

J-1 

2«+^+' r{a + n+l)r{^ + n+l) 

~ n\ ia + p + 2n+l)r{a + p + n+\y'"" 

for a, /? > -1, and integers n, m > 0. 

Corollary 4.6 (Legendre Orthogonality). 

f‘ 2 

Pn{y)Pm{y) dx = 

. _i 2n + 1 



for integers «, m > 0. 



The corollary follows since 

4.7.3 Orthogonality for Associated Legendre Functions. By definition 
(4.2.11) 






p{x)Pn"'\x) dx 



-1 



^ ^ [‘ 
2« nl J_i 



^/1+m 






(-1)” 

V ^ ^ / 1 xB+m 



2" n! 



(- 1 )" 



(4.7.32) 



„(«+m)(^)(l _ ^2)B 



for any polynomial p where the latter statement of equality follows by integration 
by parts, as in preceding arguments. In fact, here we can take p to be any 
function since we only need that 



p(x) — (l-x^)” 



= 0 . 



(4.7.33) 



Choose p(x) = pI,'"\x){ 1 — x^y for m <n: 



2\m . 



n+m y V 






dx^+m-j 



Since = 0 for m + y > « = degree of P„ and similarly 

jn+w-y 



(1 = 0 



dx^+^-J 



(4.7.34) 



(4.7.35) 
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foin + m- j > 2m, we obtain a non-zero summand in (4.7.34) only for m + j < n 
and n-\-m- j < 2m — i.e., only for « = m H- j: 

pC'^'-\x) = ^ '")pW(x)^(l - x^r. (4.7.36) 

\n-mj 

Now P„ = Pf ^ P!f\x) = Pf = (n + l)(n + 2) • • • (« + «)/2" 
by (4.7.29), and 






(1 -x^)" 




(4.7.37) 



where 4^{x^y = 0 unless 2m < 2j. That is, only j = m contributes to the sum 
in (4.7.37): 

j2m j 2 w 

— (1 - x^r = = (-l)'"(2m)! ^ (4.7.38) 



\n-mj 2" 

_ (« -h m)! {n + l)(n + 2) • • • (« -h «)(-!)'” 
(n-m)\ 2^ 

by (4.7.36). Also by the change of variables x = cos 0 



(4.7.39) 



fi 

(1 - x^)" dx = 2 

-1 



(l-x^)"dx = 2 



Jo 



nil 



sm 



2n+\ 



4)d(P 



= B{n+l,^) = 



2 • 2 • 4 • 6 ■ • ■ 2« 
l-3-5-7-.-(2n+l) 



2"+‘ n! 

l-3-5---(2«+l)‘ 

(4.7.40) 



Equations (4.7.32), (4.7.39), (4.7.40) therefore give 

I Pi'"\x)2(l - x^)"' dx 

2{n + m)\ (n + l)(n-\-2) ‘ ' (n + n) 

~ (n-m)l 2«-l-3-5*--(2«-l)(2«+l) 
2(n + m)! 

(n - m)l (2n + 1) 



(4.7.41) 



since (1-3-5 • • • (2n- 1))*2*4-6 • • -2n = (2n)\ = n\{n+l){n-\-2) • • • (n+n) ==> 



(n -1- l)(n + 2) • • • (n + n) 2 • 4 • 6 • • • 2« 



2«-l-3-5*--(2«-l) 



2« n\ 



(4.7.42) 
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If we now choose p(x) = - x^)^, then for / < n, = 0, since 

n + m>l + m = (I -m) + 2m = degree of p. This means that for / < « 

fi 

p/"^(x) pj,'"\x) (1 - dx = 0 (4.7.43) 

. -1 



by (4.7.32). Similarly if / > «, replace PjT\x) by P/^\x) in (4.7.32) and choose 
p(x) = Pn^\x)(l - x^)"* to conclude that (4.7.43) holds for / > n also, which 
with equation (4.7.41) allows the conclusion 

Theorem 4.7 (Legendre Orthogonality). 



r, 



P^"'\x)Plr\x)(l-X^rdx = 6,n 



2 (n + m)! 
2n-^l (n- m)\ 



for m < n. For m > n the integral is zero since then P^f^^ = 0. Here n,l,m = 
0, 1, 2, 3, In particular for m = 0we obtain a second proof of Corollary 4.6. 

The associated Legendre functions P^^ defined on (-1, 1) extend to continuous 
functions on [~1, 1]. Namely, for m > 0 

lim (1 - xY^^ = 0. (4.7.44) 



Since P^(x) (1 — P„^\x) we set 



P„"(±l) = 



0 for m = 1,2,3, .. . 
P„(±l) form = 0 



(4.7.45) 



On (-1, 1), P”(x)P,'"(x) (1 -x2)'"Pi"’’(x)P/'"^(x), which also holds on [-1, 1] 

by (4.7.45), which means that Theorem 4.7 can be expressed as follows. 

Theorem 4.8 (Orthogonality for associated Legendre functions). For integers 
nj,m>0 with m < n 



P"(x)P„'"(x) dx = S,„ . (4.7.46) 

2n + l {n- m)\ 

Here we may extend to a continuous function on[-\,\]by definition (4.7.45). 
Equation (4.7.46) may be written, for m < n, 

[ P;'"(cos j))P„"'(cos j))sinj)dj> = di„ f (4.7.47) 

Jo 2rt+l(n-m)! 

by the change of variables x = cos For m > n, P^ = 0. 
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4.7.4 Hermite Orthogonality. As another example consider the Hermite case 
where we take U = R: a(x) = 1, r(x) = -2x, p{x) = exp(-x^), = 2n, 

= (-ITHn. By Corollary 4.1 

(cyp){x)i-irH'„ix) = -2n-^ expC-x^) 

(-2n)(-l)"“'exp(-x^)ff„_i(x) 

for n > 1; i.e., i7^(x) = 2nHn-\(x) so that by Corollary 4.2 the following recur- 
sion formula holds. 

Proposition 4.11. For n > 1, x € M, 



Hn+i(x) - 2xHn(x) + 2nHn-i{x) = 0; 

also H;(x) = 2nHn-\{x). Hence -fff \x) = l’^n{n - 1)(« - 2) ■ • • (n - A: + 
\)Hn-~kM for \ <k <n. 

For any a > 0 



2(w - m) 



'a 

exp(-x')F„(x)H„(x) dx = [exp(-x2)W^(if„,i/„)(x)]“^ 

-a 



(4.7.49) 

because of equations (4.7.1), (4.7.3). As 1F(F7„, Ffw)(^) is a polynomial in x we 
can let fl 00 and conclude 

Proposition 4.12. 



•00 

Ff„(x)if^(x)exp(-x^) dx = 0 

J-oo 



for n^m. 

If p is any polynomial 



•00 poo in 

p{x)Hn{x) exp(-x^) dx '^= (-1)" exp(-x^) dx 

J-oc dx” 

-t 



(4.7.50) 



p^'"^(x)exp(~-x ) dx 



again by induction on n, as in equation (4.7.10), where we integrate by parts and 
use that 



dx” 



= 0 



since 



^exp(-x^) = (-l)"exp(-x^)J?„(x). 
dx” 



(4.7.51) 



(4.7.52) 



Now Hi’'\x) — 2”n\Ho(x) = 2"n\ by Proposition 4.11. The choice p — H„ 
therefore provides in conjunction with Proposition 4.12 and the fact that 
[!°oo exp(-x^) dx = 
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Theorem 4.9 (Hermite Orthogonality). 

poo 

Hn(x)Hm(x) exp(-x^) dx -T n\ \fn Snm^ 
J— 00 



Later we shall also use 
Theorem 4.10. For « = 0, 1, 2, 3, , 



'00 

dx = 2"-‘n!(2n + l)v^. 

J— 00 



Proof. For q(x) = x'^ choose p = qHn in (4.7.50): 

poo poo 

x^Hl{x)e~^" dx = dx. 



By Proposition 4.11 

Hj,"~'\x) = 2"-'n! Hi(x) = 2""'n! 2x = 2"n! x, 

Hi'’~^\x) = 2"“^y ff 2 (x) = 2"“^y (4x^ - 2) = 2"-*n! x^ - 2"~^n\ 



(4.7.53) 



(4.7.54) 



(4.7.55) 



for « > 3. These formulas also hold for « = 2, by direct calculation, again as 
H 2 (x) = 4x^ - 2. By equation (L2) in Section 4.4, 

p^”\x) = x^77‘"\x) + 2nxHi"~^\x) + 2H^"~^\x). (4.7.56) 

Having noted also that 7T^”\x) = T^n\^Q see that p^”\x) = 2^n\ x^+2nx2^n\ x+ 
n(n - l)(2^-^n\ x^ - 2^~hl) = 2'^~^nl(n + l)(n + 2)x^ - n{n - l)2^-^n\ . 
Since 

[" dx=^, r x^e-*' dx=^ (4.7.57) 

Jo 2 Jo 4 

the right-hand side of equation (4.7.54) becomes (for n>2) 

2 • 2"“^«!(« -h l)(n + 2)^^ — 2n{n - l)2”“^n!^^ 

^ M ^ ^ 2 (4.7.58) 

= 2"“^«!(2« -h 1)V^, 

which proves the theorem for n>2. For n= 1, we use H\{x) = 4x^, and that 

fOO ^ 

x^e~^^ dx = -yjn (4.7.59) 

Jo o 

to check the theorem directly. For n = 0, the theorem states that 
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which is true by (4.7.57). Note for the record that the generalization of equa- 
tions (4.7.57), (4.7.59) is 









(4.7.60) 



2"+' a" 

for a > 0, n = 1, 2, 3, ... , which follows by properties of the gamma function. □ 



4.8 Normalized Wave Functions for the 
Harmonic Oscillator 



As an example of the reduction to canonical form technique we obtained earlier 
the quantization of energy 

E = E„ = 27t(^n+^^vn, n = 0,l,2,3,... (4.8.1) 

for the harmonic oscillator with frequency v, and the corresponding solutions 
(wave functions) i/a = of the time-independent Schrodinger equation 

d^xi/ 2 m 0 0 o 

— V + -r(i:-2;r^vW)i/r = 0, (4.8.2) 

dx^ ¥ 

having noted that the quantization condition (4.8.1) is nothing more (nor less) 

than the general mathematical condition A = - nr' - of Theo- 

rem 4.2. y/^ was given on M by (4.5.8): 

<(x) = (-!)'’( v^)" exp H„iVbx), (4.8.3) 



for b where earlier we wrote i/a„ instead of y/^. We compute: 

[ y/^„{x)w°„ix)dx ^ \ e-’’^"H„{-\fbx)H,„{'fbx)dx 

J]R Jm 

[ e->"Hn{ 

Jr 



= (-l) 



= (- 1 )" 






,(\/6)"+'" 

^^b 



MHM dt 



(4.8.4) 



E 

Tn\\fn • 5n 



by Hermite orthogonality (Theorem 4.9). Therefore we obtain normalized solu- 
tions y/n of (4.8.2) by setting 



y/„{x) **= y/°(x) 



b’'Tn\^lH 



xTb 



- 1/2 



(4.8.5) 




= (- 1 )' 
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¥n{x)y/m(x) dx = Snm (4.8.6) 

Jr 

for Note that by the definition Hn(x) = for fix) = 

we have = (-l)"H„(x) (since = {-\f f^^\x) for / = 

0, 1, 2, 3, ... ) and hence the wave functions y/n satisfy the parity condition 

y/ni-x) = (-1)V«W (4.8.7) 

on R. 

¥or = we apply Theorem 4.10 to compute 

f x^y/^ix) dx cl \ Hli^/bx) dx 

Jr Jr 

cl { f .29 

= ^ -re-' Hl(t) dt (4.8.8) 

yb Jr o 

- —^2" ^n\(2n+ 
byb 

That is, 

Theorem 4.11. The normalized wave functions y/n in (4.8.5) satisfy 

f ,vw * = — = -5^ = -5^ = 

” 2b 2juvhb 47T^v^m mco 

where E„ = 27t{n + \)vh is the nth energy level, and co = 2jtv is the angular 
speed of the oscillator. 

4.9 Further Examples of the Reduction Technique: 

The Poschl-Teller Potential 

The following differential equation will arise in the next chapter: 

0 (4.9,1) 

1 - (1 - 

on (-1, 1) where a e R, m e Z. Here or(x) = 1 - x^,r(x) = -2x, cr(x) = 
-a(l - x^) - m^. 

In this case, from (4.3.4) 

fAx) + Ka{x) - d{x) 

- x*(l - x^) + a{\ - x^) + = -{a + k)x^ a + k -\- n?' (4.9.2) 

=» A(k) = 4(a + K){a + x- + n?). 
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A(;c) = 0 =» 

A(x) - 

=» p(x) = -i 



a + K = 0 or a + K-\-rrr = 0 



m 



if + /c = 0, 



nrx^ if + /c + = 0 

\m\ \ia-\- K = 0, 

\m\x if + /c + = 0; 



i.e., ^ = A. 

By our general theory we have the definitions (4.3.5), (4.3.6): 

k{x) = 7t^(x) ^—;^{x)±p{x) = ±p{x), 

def 

t(x) = f(x) + 27t(x) = —2x ± 2p(x). 



(4.9.3) 



(4.9.4) 



We choose t such that t' < 0 and t(x) = 0 for some x in (-1, 1); cf. Lemma 4.1. 
This can be achieved by taking a + K + rn^ = 0 (i.e., k = -(a + m^)) and tt = 7t~ 
(that is 7u(x) - —\m\x); i.e., t(x) = —2x — 2p(x) = —2x - 2\m\x = -2(|m| + l)x 
(which vanishes at 0 in (-1,1)). Then 



A = K + 7t' = -{a + m^) - \m\ = -a - |m|(lm| + 1), and 

, def , n(n- 1) „ 

= -nr (7 = n{2\m\ + n + l). 

The diiferential equation for u is thus reduced to the canonical form 
(1 - x^)y"{x) - 2(\m\ + l)xy'{x) + [-a - \m\(\m\ + l)]y(x) - 0 



(4.9.5) 



(4.9.6) 



on (-1, 1), which is Jacobi’s differential equation (4.2.10) 

(1 - x^)y"(x) + [-(a + ^ + 2)x + ^ - a] /(x) + py(x) = 0 

for p = —a — |m|(|/nl + 1), a = p = \m\, and which thus has as a solution 

provided n(a + ^ + n + 1) = ^ — i.e., provided = A as usual, as shown in 
Section 4.6. Now 



An = A 4 =^ 

-a = |m|(|m| + 1) + n(2|m| +«+!) = (|m| + n){\m\ + « + 1). 



Thus if An = A, then -a = /(/ + 1) where / > 0 is an integer (|m| + n) > \m\. 
Conversely, given an integer / > |m|, take n = I - \m\ > 0 (an integer), -a = 
/(/ + 1) and thus conclude that /(/ + 1) - |m|(|m| + 1) = n(2\m\ + n + 1); i.e., 
A = An => ^ solution of equation (4.9.6), with -a = /(/ + 1) there. 

A choice of 4> in Proposition 4.4 such that ^' = ^;r/(7 on (-1, 1) is given by 



4>(x) = exp 



(I 



n 

— (x) dx 
a 




— \m\x 
l-x2 




= (1 - x2)l'"l/2. 
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Also let w/(x) = on (-1, 1) and conclude by general principles 

(Theorem 4.1) that w/ is a solution of our original equation (4.9.1), for -a = 
/(/ + 1) there. On the other hand, by Proposition 4.9 



(/ + p(q,q) 



d(^) _ _ 

' "V ' / " l\2^ 

for ^ > 0 an integer with I > q. Therefore 



(4.9.7) 



U,ix) = (1 - P<N>(x) (4.9.8) 



(/ + H)! ' 



(/ + H)! ' 



where is the associated Legendre function on (-1, 1); see definition (4.2.15). 
In particular 

Corollary 4.7. is a solution of 

u"{x) - 0 

on {-I, l)for integers /, m with I > \m\. 

In the next example we apply the reduction technique to obtain solutions of the 
Schrodinger equation 



W"{x) + ^{E-V{x))wix) = 0 



in case V is the Poschl-Teller potential: 

def -Vo 



V(X) = 



cosm ax 



Vo>0, € R - {0}; 



(4.9.10) 



(4.9.11) 



thus F(x) < 0 on R. 

Given the structure of V it is convenient to make a change of variables x 
V = tanh ax. That is, define the function / by 



/(v) = - tanh ' V = T log iiZ on (-1, 1); 
a 2a 1 — V 



/'(v) = 



1 



/"(v) - 



2v 



(4.9.12) 



a(l-v^y ■' a(l-v2)2‘ 

Given a function on R (for example y/ a solution of (4.9.10)) set 

^{v) = ¥{f(v)) on (-1,1). (4.9.13) 

Then 

3>'(v) = >^'(/(v))/'(v), 0"(v) = v^'(/(v))/"(v) + 

0'(v) 

=> V^'(/(v)) = 0'(v)//'(v) ^ vr"(/(v))/'(v)2 = <D"(v) - 

/'(v) 



0"(v) 0'(v) 

= a^(l - v^)^4>"(v) - 2a^v(l - v2)0'(v), 



(4.9.14) 
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by (4.9.12). Also 



cosm(tanh * v) = 



exp (5 log + exp (-i log 



1 

1-V2 



(4.9.15) 



(where one squares the bracket and simplifies) which means that 

V{f(y )) , - -F„(l - v^). (4.9.16) 

cosh tanh v 

Assume that vr is a solution of (4.9.10); take x - /(v) there to get by (4.9.13), 
(4.9.14), (4.9.16) 



a\l - v^f^"(v) - 2a^v(l - v^)O'(v) + — [E + Kd - v^)] 0(v) = 0. 

(4.9.17) 



That is, for 



def 



<t(v) = 1 - 



f(v) - 2v, cr(v) ‘*= + F„(l - v^)], (4.9.18) 



equation (4.9.17) reads 

a{v)^Q>"{v) + o-(v)f(v)0'(v) + d-(v)0(v) = 0 (4.9.19) 

on (—1, 1 ), which therefore is an equation of hypergeometric type, by (4.2.1). As 
in the example of the finite potential well in Section 3.3 of Chapter 3, and in the 
case of a hydrogenic atom considered in the next chapter, we look for bound states 
O — solutions of (4.9.19) for E <0. Thus —ImEla^h^ > 0 and we set 









-ImE 




6 -(v) in (4.9.18) can be expressed in terms of p, y\ 



a(v) = + -v^). 



(4.9.20) 



(4.9.21) 



We proceed to reduce equation (4.9.19) to canonical form. For x e C, v e 
(-1, 1), A(v) in (4.3.4) is given by 

fAy) = (Y^-ic)v^ + P^ + K-Y\ (4.9.22) 

in view of (4.9.18), (4.9.21). The discriminant A(/c) is —4{p^ + ic - y^)(y^ — ac), 
which vanishes for k - - p^. If /c ^ and A{k) = 0 then k = 




114 



4. Hypergeometric Equations and Special Functions 



^2 _ ^2 ^ /^(v) = in (4.9.22) ==» has a polynomial square root p 
given by 



p(v) = Pv. (4.9.23) 

In case k = wt have A(/r) = 0, A(v) = p^ is di constant, and p(v) P => p 
is a square root of /;^. in (4.3.5) is given by ;r*(v) = ±p(v), and t(v) 
in (4.3.6) is given by r(v) = f(v) + 2;r(v) for one of the choices n = We 
assume k ^ y^ (so that p is given by (4.9.23)) and we make the choice n = n~ , 
as this insures that r satisfies Lemma 4.1: 

T{y) = -2{\+P)v (4.9.24) 

t'(v) = -2(1 + P) < 0 and r(Vo) = 0 for some Vq e (-1, 1) — namely Vg = 0. 
Thus 



7r(v) = -Pv and A = k + tt' = y^ - P^ - P (4.9.25) 

in (4.3.6), and we see that a canonical form of (4.9.19) is 

tj/ + Ty' + Ay = 0 (4.9.26) 

for (j, T, A given by 

<7(v) = 1 - v^ t(v) = -2(1 + /9)v, X = r'^-p^-p (4.9.27) 

(by (4.9.18), (4.9.24), (4.9.25)). Also in (4.3.8) 

^(v) = (1 - vY/2 (4.9.28) 



By Theorem 4.1, y solves (4.9.26) <=> O solves (4.9.19) provided O, y are 
related by 



O = (py. 



(4.9.29) 



A solution h of 

h'iv) = ^ = on (-1, 1) (4.9.30) 

cr(v) 1 - 

is given by h{v) = log(l - v^). Therefore by Proposition 4.4 a solution p of 
{pay = pr on (-1, 1) is given by 



tr(v) 



( 4 . 9 . 31 ) 



Finally, forn = 0, 1, 2, 3, . . . let 



, def , n(n - 1) „ It?" 

X„= -tiT a- , y„ = - — (<T p). 

2 p dv" 



2 



(4.9.32) 
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That is, by (4.9.18), (4.9.27), (4.9.31), and (4.2.14) 



A„ — 2w(l + j9) + n(n ~ 1), 

(A.Q 

y„(v) = (1 - v2)-^-^(l - = r n\ (-1)" Pf ^V) 

dv^ 

where is the Jacobi polynomial of degree n. Then by Theorem 4.2 we know 
that y„ is a solution of (4.9.26) provided that A satisfies the integrality condi- 
tion A = An, which provides for the quantization of energy. Namely, by (4.9.27), 
(4.9.33), A = An means thdii = 2n{l+P)-\-n(n-l), which is a quadratic 

equation in p with solutions 







(4.9.34) 



As > 0 we choose p = p^, which we denote more simply by Pn. That is, by 
definition (4.9.20) the condition 2 = gives 



E = 



def 
hn - 



a^n^Pn 

2m 



(4.9.35) 



Note here that the condition pn>0 forces n to satisfy 



0 < « < 




1 

2 ’ 



(4.9.36) 



so that there are only finitely many eigenvalues (=negative energy levels) pre- 
scribed in (4.9.35). The corresponding wave functions xj/n (=solutions of (4.9. 10)) 
are given as follows. Going back to (4.9.13), (4.9.29) we take va„(/(v)) = 0„(v) 

^>«(v)y«(v), where ^„(v) (1 - see (4.9.28). That is, for x = / (v) we 

have V = tanhflx (by (4.9.12)) => 



Vr„(x) = (l-vY"/^y„(v) 

= 2" n! (-l)"(sechflx)^“ Py"’^"\tanh ax) (by (4.9.33)) 



(4.9.37) 



for 




> 0 , 






2 



F - F - 

’ " 2m 



where P„>0 =4> n is subject to the inequality in (4.9.36). 



(4.9.38) 



Appendix 4A Spherical Harmonics 

For integers /, m with / > 0, -/ < m < / define on R x (0, tt) by 

Zl‘{e,4>) = e'’^Py\cos4>), 



(4.A.1) 
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where is the associated Legendre function in (4.2.15). Let g : ^ be 

the spherical coordinates map: 

g(p, 6, = (p cos 6 sin p sin 9 sin p cos ^). (4.A.2) 




P = (x,y,z) = (p,0,4)) 



X = psind> cos 9 
y = psin4> sin 9 
z = pcos0; 



0 <9 <2k 
0 < ^ 

X^+y2 + Z^ = p2 



Figure 4.1: Spherical coordinates 



We shall need the expression of the Laplacian in spherical coordinates: 



Theorem 4.12. Let W = {(jc, y, ;^) e R^|(x, y) = (0,0)}, and let T = {a = 
(p, 9, 0) G R^l both p, sin0 ^ 0), which is an open set in R^ such that g(a) G 
-W a e T for g in (4.A.2). In particular g : T R^ - 11^, that 

given a function f :R^ — W -^R which has continuous first and second partial 
derivatives on the open set R^ - 11^ C R^ - {0} can define F : T ^ Rby 
F{a) = f{g{a)), a eT. Then for a eT one has the following equation: 



aV d^f aV 

^(g(«)) + 




1 ^ 
sin dd> 




1 d^F 
sin^ 4 > 99 ^ 



(a) 



(4.A.3) 



The proof, as is well known, involves only the chain rule. 
Note that for 



5+ = {^3! = (p,9,<p) G R^Ip > 0,a G R,0 < 0 < ;t}. 



(4.A.4) 




Appendix 4A. Spherical Harmonics 



117 



F*- {(x,y,z)6E'|(x,y)7^(0,0)} 



one has 

Proposition 4.13. g : ^ V* is one-to-one and onto. 

In this appendix we prove 

Theorem 4.13. There exists a unique function C such that 



Yrigip, e, <P)) = p‘ Zf{e, 4>) - (cos 4>) (4.A.5) 



6)w R X R X (0, ;r). In fact Yf^ is a homogeneous polynomial of degree I (i.e., 
Yf^{tx, ty, tz) = t^Yf^ix, y, z)for (x, y, z) € R^, t G R) and is harmonic: Y^Yf^ = 



The functions {T^^}/,^eZ,/>o,-/<m</ called spherical harmonics of degree / 
and order m. The uniqueness assertion of Theorem 4.13 is clear since g is surjec- 
tive. 

From (4.2.11) one has 



for m < I, where (x^ - 1)^ = ^^[=0 If I + m > 21 -2k (i.e., 

2k > I - m) then = 0. Thus 



p("») 





dl+m 

— ; — ^ 
dx^+^ 



2k<l—m 



= ^ Z (l)(-l)H2l-2k){2l-2k-\) 

' 0<k<l ^ ^ 

l-m-2k>0 

• {21 - 2k - 2) ■ • ■ {21 - 2k - (I + m - i))x2'-2*=-('+'") 

Z ([)(-D'‘(2/-2A:)(2/-2A:-1) 

‘ 0<k<l ^ ^ 

l-m-2k>0 

■{2l-2k-2)---{l-m-2k+ 



(4.A.7) 



If K is the largest integer k G [0, /] such that / — m — > 0 we can therefore write 

P^ix) = Ef=o dkx'-"'-^’^ for suitable 4 = 4,,,„ eR;0<m</;K = 
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For m {0}, 9 eR we can write 






|m| 



COS 






m 



COS 9 + i - — - sin ^ 
\m\ 



\m\ 






= 2 ej sin^‘ 9 cosl'”'-^’ 9, Cj = ej,m e C. 
7=0 



7=0 

\m\ 



Now for / > 0, -/ < m < /, /, w G Z, 
4>) e‘”'^pf'\co& (p) 



on R X (0, 7t) with 



\m\ 



^im 0 _ ^ Q j Q for m 0, 

7=0 

K 

(since |/n| < /), 



k=0 



(4.A.8) 



(4.A.9) 



(4. A. 10) 
(4.A.11) 



as just seen. ForO < ^ < ;r, cos^ 6 (-1, 1) (cos^) (1 — 

p/'^'^cos^) = {sm^4>y"‘y^P^'”''\cos4>) with sin^ > 0 : (sin^ = 

(sin^)l'"l; i.e., 

p'^'Ccos p) = (sin ^jl^lp/'^'^cos p). (4.A.12) 

def 

For m = 0, equation (4. A. 10) holds with the right-hand side interpreted as = 1. 
That is, 

Z^{e,<p) = e‘"'^pj”'\cosp) 

= e"”^sin^)l'"lp/''"'^(cos^) 

K |m| 

k=0 j=0 
K \m\ 

= ^ ^ ejdkicos 9 sin 0)’'”’~^(sin 9 sin 0)^(cos 

k=0 7=0 



(4.A.13) 
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Therefore if we define Y!^ : C by 



Y^ix, 3^, Z) = 2 2 ejdkix^ + / + 

k=0 j=Q 

we obtain for 0 < ^ < ;r, 



(4. A. 14) 



Yj^ip sin 4> cos 9, p sin 4> sin 9, p cos 4>) 

K \m\ 

= ^ ^ejdkp^^ {sirup COS p^ (sin sin 9) 



;^/-|m|-2/c(cos^)/-NI-2fc 



k=0 7=0 
K \m\ 



= ^^ejdkp^(sinpcos9)^^^ ^ {sinpsin9y{cosp) 



l-\m\-2k 



k=0 7=0 

p^zy{9,p) by (4.A.13) 



(4.A.15) 



as desired. By definition of 7^"* one has Yl^{tx, ty, tz) = t^Y^^ix, y, z) for (x, y, z) € 
R^, t G M. To conclude the proof we need to show that TJ'” is harmonic: ¥^7^"* = 0. 
The differential equation in (4.9.9) of Corollary 4.7 



(1 - x^)y"(x) - 2xy\x) + /(/ + 1) - 



y(x) = 0 (4.A.16) 



on (-1, 1), satisfied by the associated Legendre functions /, m = 0, 1, 2,3, • - . , 
m < I can be expressed in an alternate manner: By the change of variables x = 
cos ^ for 0 G (0, 7t) one has 



sin p dp 



(sinp)—y{cosp) = (siryp)y" (cos p) - 2(cos p)y' {cos p) 
dp 



i\-x^)y"{x)-2xy'{x), 



(4.A.17) 



SO that (4.A.16) may be expressed as 



sin p dp 



— \{sinP)—y{cosP)\ + /(/ + 1) - y(cos0) = 0 (4.A.18) 



on (0,;r), for /, m = 0,1,2, 3, As Z’P{9,p)^= e^'”^pj^*(cos^) one obtains 



immediately from (4.A.18) that Zj” satisfies 



sin p dp 



(sin^)— — (6>,0) + 



1 d^Zy 
n^p d9^ 



-{9,P) = -l{l + l)Z^{9,P) 



(4.A.19) 



on R X (0, Tt)’, /, m G Z, / > 0, |m| < /. 
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By Theorem 4.12 for u = (r, 0, e with r 0, sin ^ 5^ 0 






1 



dr 



,dF 



t: US- («) + 



dr 



I ^ 

sin <p 34> 






(a) + 



1 d^F , 
sin^^ dO^ 



(«)J 

(4.A.20) 



for 



F(a) = y"(g(a)) = r'Z“(0, </.). (4.A.21) 



Now 

Tr ('■'^) ^)(= /(/ + DUa)), (4.A.22) 

which by (4.A.19), (4.A.20) gives (V^y”)(g(a)) = 0 for a = (r,9,4>), r 
0, (6,^) € R X (0, ;r). By Proposition 4.13 we must have (V^y”)(x,y, z) = 
0 for (x,y,z) € R^ such that (x,y) (0,0). By continuity (V^y”)(x, y, z) 

= 0 V(x,y,z) € R^: For example lim„^oo “ (0>0,z) =4> 0 = 

lim„^„ (vS'") l,z) = (V^y™) (0,0, z). □ 

By (4.A.2), g = igug2,gi) where 



gi(a) = /jcos^sin^, g2(a) = psinOsin^, g^ia) = pcos^ (4.A.23) 
for a = (/?, 6, 0) G Let F = o g : ^ C. By the chain rule 



dF 



dYr 






dy," 



dg2, 



dy" 



dg3. 



dy," 



dy," 



= — T—ig(a))g2ia) + — (g(a))gi(a). 
OX oy 



(4.A.24) 



On the other hand, by definition (4.A.5) F(a) = p^e'^P^*^'(cos 4>) => 

~^{a) = P^^\co^<p) = imF(a). (4.A.25) 

00 

Now g maps onto R^. Thus for (x, y, z) G R^ arbitrary, write (x, y, = g(a) 
for some suitable a = (p, 6, ^) € R^. By (4.A.25) and (4.A.24), 

def dY,^ dYj^ 

imYl^(x,y,z) = imF(a) = — -^(x,y,i^)y + — ^(x,y,i^)x, (4.A.26) 

' ox oy 

which proves 

Proposition 4.14. For the differential operator D x^ ~ 




Appendix 4B. The Polynomials 



121 



Using definition (4.7.45) we can extend Zj” to a continuous function on R x 
[0, 7v]. Namely 

Z'l‘{9,<p = 0 or jt) = 

Since for m,r eZ 

p2;r 

exp(/m^) exp(-/r0) d6 = lirS^r (4.A.28) 

Jo 

Theorem 4.8 leads to the following. 

Theorem 4.14. Let l,n,m,r eZ with l,n>0, -I < m < 1, -n < r < n. Then 

1 r 1 (l + \m\V 

— Zf(^,(^>)Zi(^,(/>)sin(^) # de = ) \[A l,rn)Xn,ry 

4^ Jo Jo 2/ + l(/-|m|)! 



Appendix 4B The Polynomials L„ 

For « > 0 an integer define 

Ln = n\Ll (4.B.1) 

where is the Laguerre polynomial of degree n and order 0. By (4.2.12) 

d" 

L„(x) = e^ — (e~^x^) for x E R. (4.B.2) 

Proposition 4.15. The jth derivative of Ln is given by 

= i-iyn\Li_j for 0<j<n, (4.B.3) 

= 0 for j > n. 

Proof (4.B.3) is true for 7 = 0 by definition (4.B.1). By Proposition 4.7 

(L“y(x) = (-iyCj(x) (4.B.4) 

on (0, 00) for w > 7. There we took x > 0 to define p{x) = x®e“^, x“ = 
a E R. In the present situation however a = 0 E Z, and we may assert that (4.B.4) 
is valid for all x E R. Similarly, by Corollary 4.4, for a E Z 

x(L^)"(x) + (a + 1 - x)(L«)'(x) + nLlix) = 0 (4.B.5) 

for all X e R. By (4.B.4) therefore, = n!(L“)0) = «!(-iy'L^„_^. on R for n > 



j, as desired. Of course = 0 for j > n, as L„ is a polynomial of degree n. □ 
By (4.B.4) and (4.B.5) we see moreover the following. 

Corollary 4.8. For integers 7 , n 

Xiiyyxx) + O’ + 1 - x){Lyy(x) + (n - j)Ly\x) = 0 

on R/or 0 < j < n, and hence for all j,n>0. 



u ivi m ^ u 

P/(l)orP,(-l) for/n = 0 



(4.A.27) 




5 

Hydrogen-like Atoms 



Because of considerable mathematical difficulties the Schrodinger equation can 
be solved exactly for only a few simple quantum systems. In particular for atoms 
with at least two electrons no such exact solutions exist. The most important sys- 
tem where an exact solution is indeed possible consists of a single electron with 
charge q\ = -e in motion under the influence of a Coulomb potential about a 
nucleus with a positive charge - Ze,fox Z > I an integer. Such a system is 
called a hydro genic or hydrogen-like atom, with atomic number Z. One may view 
the nucleus as consisting of Z protons. For Z = 1 we obtain the hydrogen atom. 
For Z = 2, for example, the system is singly ionized helium He"^; this is helium 
with its two protons as usual, but stripped of one of its two electrons. Since it has 
only one electron but two protons it is positively charged, as indicated by the plus 
sign +. Similarly the lithium atom Li consists of three electrons and three protons 
(and four neutrons). If we strip off two electrons we obtain the ion Li"'"'' — doubly 
ionized lithium, which is the case Z = 3. 

The wave functions of hydrogenic atoms can be used to provide a fairly good 
qualitative description of properties of multi-electron atoms and molecules — the 
starting point or model for the study of all other atoms being the hydrogen atom. 
Hence, the importance of this chapter is clear. If our system had n > 2 elec- 
trons instead, then we could study it by assuming for simplicity that each electron 
moves independently of the others and is influenced only by the average field due 
to these other electrons, and of course by the field due to the nucleus. A discussion 
of multi-electron atoms will be taken up in Chapter 10. 

In addition to obtaining explicit wave functions for the hydrogenic atom we 
will obtain the other valuable piece of information concerning the specification 
of possible energy states the electron can occupy. As in the old theory of Bohr 
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we will discover that the (negative) electron energies are not arbitrary, but are 
quantized: only certain values are possible, similarly as we have seen in 

previous examples. These wave functions will carry three bits of quantum data 
(«, /, m): the total quantum number n = 1, 2, 3, . . . , the azimuthal (or orbital) 
quantum number I = 0, 1, 2, . . . , n - 1, and the magnetic quantum number m = 
-/, -/ -h 1, . . . , -1, 0, 1, 2, . . . , In accord with earlier notation the pair (/, m) e 
1, xIj, with / > |m|, indeed will arise as the label for the associated Legendre 
function P] . The energy levels will depend only on the total quantum number 
n. Thus we can obtain a rather comprehensive picture of the hydrogenic atom. 
The above data, however, is not sufficient to account for experimentally observed 
phenomena such as fine-structure splitting of spectral lines and Zeeman effects. 
For this one must consider, in addition, the electron-spin quantum number 5 = ^, 
a point which we will deal with later. Here, as shown by Uhlenbeck and Goudsmit, 
one assigns to the electron an “angular momentum and magnetic moment,” as a 
charged object spinning about an axis would have. The electron however does not 
literally spin about an axis. 



5.1 Solutions of Schrodinger’s Equation for 
Hydrogenic Atoms 



Let (xi,yi,zi), denote the positions of the nucleus and the 

electron, and their masses respectively. Let U c be the open subset { (p \ , P 2 )\pi 
7 ^ P 2 } considered in Section 2.5 of Chapter 2, where we now write pj = (xj, yj, zj) 
for y = 1,2. Define continuous functions :U by 



F^'\puPi) 



q\qi{p\ -pi) 
\p\ -P2? 



-Ze^ipi -p2) 
\P\-P2? 



(5.1.1) 



_p(2) = Then, as discussed in Chapter 2, the electrostatic Coulomb force 

exerted on the electron by the nucleus is F^^\p\,p 2 ): 




nucleus with 
charge Zc at 
positional = (xi,yi,zi) 

Figure 5.1: Electrostatic coulomb force on an electron 

The arrow in Figure 5.1 indicates the direction of F^^\p\,p 2 ), and r is the 
distance from p\ to p 2 . Of course F^^\p\,p 2 ) (the force exerted on the nucleus by 
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the electron) and F^^\p\,p 2 ) have the same magnitude: 



\F^^(PuP2)\ = 



Ze^ 



(5.1.2) 



by equation (5.1.1). If : U R denote the components of F^-^\ then 



^ dxj’ 



dV dV 

oyj ozj 



for the potential energy F : 17 M of our system given by equation (2.5.4) for 
= -e, qi = Ze: 



V(pi,pi) = 



-Ze^ 



r = [(xi - X2)2 + (yi - y2)^ + (Zi - 
By equation (3.4.10), the time-independent Schrddinger equation (TISE) is 



(5.1.4) 



for 



1 9 1 9 8;r^ 

— V> + —V\w + —{E -V)yf = Q 
mi ^ m2 ^ ¥ 



^2 



= — + — + 

' dxj dy] dz 



(5.1.5) 



(5.1.6) 



To study equation (5.1.5) we pass from Cartesian coordinates to center of mass 
coordinates, via the diffeomorphism O of R^ such that 0(C/) = (R^ - {0}) x R^, 
given in Chapters 2, 3; see Section 3.8 of Chapter 3. We look for a continuous 
function i// : R^ C which solves (5.1.5) such that y/\u is class on U: \i/\u 
has continuous first and second partial derivatives on U. Consider / = y/ o : 
R^ ^ C; / is continuous and f\^(u) is class C^.lf p = is the reduced mass 
of the system, then O transforms equation (5.1.5) on [/ to the equation 

-Vi/+^^V^/+^(£-Fod>-')/ = 0 (5.1.7) 

' mi + m2 ¥ 

on 0([/), as we have seen, where 

—Ze^ 

(F o 0-‘)(pi,p2) - , (5.1.8) 

on 0(17); see equations (3.8.8), (3.8.9). Now apply Theorem 3.3 and the converse 
argument that followed its statement: Under a separation of variables / = /i 0/2, 
/ satisfies (5.1.7) if and only if /i, /2 satisfy 



(V^/i)(x(‘>) + 



1^ 



E,+ 



Ze^ 



\jx\ + y\ + z\ 



/i(^'*^) = 0. 



(5.1.9) 
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o_2 

for = (xi,yi, zi) € - {0}, = (x 2 ,y 2 ,Z 2 ) e K^. Here E\ + E 2 = E, 

/i,/2 : ^ C are continuous, and /i|e 3-(0)./2 are of class C^. Of course 

we have interest only in the case / 7^ 0, in which case both /i,/2 ^ 0. Note 
that the second equation (5.1.9) is the TISE for a free particle of mass m\ -I- 
ni 2 \ see Section 3.6 of Chapter 3. The energy E 2 here (called the translational 
energy of the system) is non-quantized, as we know, and generally plays a less 
important role in quantum mechanics. We will therefore maintain attention on the 
first equation in (5.1.9) in the discussion which follows. This equation is the TISE 
on - {0} of a single particle of mass fi influenced by the Coulomb potential 






-Ze^ 






(5.1.10) 



Again we search for bound states of the electron — i.e., we take E\ < 0. 
To make further progress we now pass to spherical coordinates. Let 



W= {{x,y,z)€] 



5|(x,y) = (0,0)} 



(5.1.11) 



and let 



T= {a = {r,0,4>)€[ 
= g-‘(E3-W") 



^1 both r, sin^ ^ 0} 



(5.1.12) 



where g : 



‘ denotes the spherical coordinate map: 



. def 



g(r,6,^) = (rcos0sin^,r sinflsin^, rcos0). 



Given Theorem 4.12 in Appendix 4A we can express the first equation in 
(5.1.9) as 



1 [a / . 1 ^ ^ 



1 d^F 



sin^^ dO^ 



(a) 



( Ze^\ 



F(a) = 0 (5.1.13) 



for F(a) = f\(g(a)), where we write a = (r,0,^) e R^, = g(a). Here we 

need x^^^ € R^ - (i.e., (xi,yi) ^ (0,0) or equivalently, by (5.1.12), a £ T: 

both r and sin ^ 7^ 0). Note that for 



= {a e R^|r > 0,0 < ^ < ;r,^ € R}, 

F* = {(x,y,z)GR^|(x,y)#(0,0)} =R^- 1^ 

one has (cf. Appendix 4A, Proposition 4.13) the following. 



(5.1.14) 

(5.1.15) 
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Proposition 5.1. g maps onto V*. In particular (5.1.13) holds on for f\ 
satisfying (5.1.9). g is one-to-one on S'^. 

For equation (5.1.13) consider again a separation of variables, as follows. Let 
F\ : {r e R\r >0} ^ C, G : R x (0, ^) ^ C be of class such that 



dr 

1 



(r'^F[){r) + 



AFi(r) 



d dG 



+ C{Ei-Vo{rW,(r)^0, 
1 d^G. 



(5.1.16) 



sin ' sin^^de^ 



i9,4>) = AGi9,4>) 



on E"'', E X (0, Jt) respectively where A € E, C = and Vg is defined on M.'*' 
by Vo(r) = £ for c = 91^2 = -Ze^. 

Put F = jPi 0 G on R+ X R X (0,;r) = F(r,e,4)) = Ei(r)G((9, ^). Then 

F is of class and satisfies equation (5.1.13) on 6"^. Conversely suppose F = 
Fi 0 G 7 ^ 0 satisfies equation (5.1.13) on 5"^: 



^ = \f(r'^F[)ir)Gi9,<P) + sm^^){9,4>) 

r^ dr ^ r^smtp # 



d(f' 



+ f^^^iO,^) + C{Ex-Vo{r))F,{r)G{9,4>). (5.1.17) 
r^ sin ^ 



Choose {6o,(po) e R X (0, ;r) such that G(6o,^o) i=^ 0 and divide (5.1.17) by 
G(do, ^>o). One gets the first equation in (5.1.16) on R"^ for 



, </><,) , %{9M 

^ — I \ — ^ — “J” 



(sin ^o)G(6»o, (sin^ ^o)G(9„, 4>„) 



(5.1.18) 



Similarly choose e R"^ such that F’i(ro) 7 ^ 0 and divide (5.1.17) by Ffro). 
Using that 



i(r^Fi)Go) 

r^Fxir) 



+ C{Ei-K{rg)) = --, 



which is the first equation in (5.1.16), one gets 



(5.1.19) 



0 = --G{9,4>) + 
r„ 






rl sin 4> 



rl sin^ ^ 



(5.1.20) 



on R X (0, :;r) , which is the second equation in (5 . 1 . 1 6) . Therefore consider the first 
and second equations in (5.1.16). The first equation is exactly equation (4.5.9) 
which we solved there, where E\ there was assumed to be negative. Here we 
also assume E\ < 0. Physically this means that the total energy of our system 
is insufficient to permit ionization. Let /,« > 0 be integers with «>/ + !. For 
^ =z -/(/ + 1) we obtained the solutions u„i given by (4.5.22): 



Uniir) = r‘ exp(-a„^)[n - (/ + 1)]! L^'_^}+i)(p„(r)) 



(5.1.21) 
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on for 



£ _ £(«) def _ 2n^iiZ'^e^ _ 

* h^rO- Itfirfi 



where 



ffin 



Pnir) = a„r, r € 



(5.1.22) 



(5.1.23) 



is the Laguerre polynomial of degree «-(/ + !) and order 2/ + 1 defined 
in (4.2.12). 

Let Gi be of class which satisfies 



^ 

sin 4> d4> 




Gi(0) = O 



(5.1.24) 



on (0, jt), where m eZ. Then we obtain a class solution G of the second equa- 
tion in (5.1.16) by setting G(6>, 4>) “= ^G,(^) on M x (0, !c). To solve (5.1.24) 

def 

setw(x) = Gi(arccosx)on(-l, 1). Then (5.1.24) is transformed to the following 
differential equation of hypergeometric type: 



u"{x) - u'{x) + 

I - 



[-A(l - x^) - 



(l-x2)2 



u(x) = 0 



(5.1.25) 



on (—1, 1). The latter equation was reduced to canonical form (namely to Jacobi’s 
equation) and solved in Chapter 4; see Section 4.9. Given -A = /(/ + 1) again 
where / > 0 is an integer we constructed the solutions of (5.1.25) given 
by (4.9.8): 



I I 

= (Th^^' 

Iml 

on (-1, 1) for / > \m\, where Pj is the associated Legendre function defined in 
Section 4.2 of Chapter 4. 

The task now is to piece together these various solutions to reach our original 
goal of obtaining solutions f\ of the first TISE in (5.1.9), under the assumption 
that the internal energy E\ is negative. For this we retrace our steps. Working 

def 

backwards we get solutions G\ (^) = m(cos ^) = w/w(cos ^) of (5.1.24) on (0, n), 
G{6, 4>) second equation in (5.1.16) on M x 

(0,;r), for A = -/(/ -I- 1), and F{r,e,4>) = Fi{r)G(e,4>) = «„,(r)^«,„(cos^) 
of (5.1.13) on R'^ X R X for Uni(r) given by (5.1.21), and for E\ = 

in (5.1.22). But for g : R^ R^ defined following (5.1.12), F{a) fi{g{a)). 
We finally obtain solutions (wave functions) f\ = of 
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9 Stt^u ( Ze^ \ 

Ui + — - ]Mx,y,z) = 0 (5.1.27) 

\ ^/x^ + y^ + zy 

(the first equation in (5.1.9)) on V*= {(x,y,?;) € R3|(x,;y) / (0,0)} (noting 
Proposition 5.1), for Ei = 



y^nlm ^ ^ ^ ^ 



[^-(/ + 1)]! 2H /! 

V2^(/ + \m\)\ 



>1/ 



P','"' (cos 4>) (5.1.28) 



for (r, G R'*' X E X (0, tt), where n, /, m are integers with n>l,0</<«-l, 
-I <m < I, and where 



F _ f(«) ‘?£f 

' ft 2„2 2 fi 2„2 ’ 



a„ = 



2fiZe^ 

h^n 



Pn(r) = a^r for r G E'*'. 



(5.1.29) 

(5.1.30) 



The Laguerre and associated Legendre functions in (5.1.28) are 

defined in Section 4.2 of Chapter 4. We use the superscript in as these 
functions yet remain to be normalized. It is desirable to express y, z) ex- 

plicitly in terms of Cartesian coordinates x, y, z- Such an expression is rarely (if at 
all) written down in physics books. Let : E^ C be the spherical harmonic 
of degree / and order m: 



Yl”{g{r, e, (/.)) = r'e^^P^'^'lcos <P) = r' Zf‘{e, <P) (5.1.31) 

on E X E X (0,jr); see Appendix 4A for details. If we write 

|(x,y,z)| = Vx^~+yM^, 



then by (5.1.28) 

„ , , [«-(/ + 1)]!2H/! 

Wnin,(x,y,z) = . ... exp(-a„|(x.y,z)|/2) 

V2;r(/ -I- |m|)! 

• if_^|+i)(«„|(x, y, ^)|) y^lx, y, Z) (5.1.32) 

at least for (x,y) / (0,0) (by Proposition 5.1). We use the right-hand side of 
(5.1.32) in fact to define f\ (= y/hn) Thus /i : E^ C is con- 

tinuous, and /i : E^ - {(0,0,0)} ^ C is C°°, since d : (x, y, ^)^ |(x, y, z)| = 
+ Vx^TyTT^ is on E^ - { (0, 0, 0) } . We therefore have that 
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is a continuous function r on - {(0,0,0)} D V* which vanishes on V*, 
by (5.1.27). But then r must vanish on R^ - {(0,0,0)}. Namely given p = 
(x, y, :^) G R^ - {(0, 0, 0)}, we already have r(p) = 0 if p e V* — i.e., if (x, y) 

(0, 0). If (x, y) = (0, 0) then z 5 ^ 0 and p„ (l/«, 1/n, z) (n = 1, 2, 3, . . . ) € F* 
such that pn ^ p = ( 0 , 0 , z) on R^ - {( 0 , 0 , 0 )} => r(p) = lim„_^oo ^(/^«) = 0 , 
as each r(p„) = 0. Thus we have indeed constructed continuous functions f\ = 
y^nim • C which in fact are on R^ - {( 0 , 0 , 0 )} and which on the latter 
domain solve the first Schrddinger equation in (5.1.9). Because the approach here 
was based on the general theory of hypergeometric equations, developed in Chap- 
ter 4, the were obtained without recourse to the traditional method of power 
series. Note that the quantum energies in (5.1.22), or in (5.1.29), are exactly 
those advertised in (1.2.4), where the special case Z = 1 was considered. 



5.2 Normalized Wave Functions 

The solutions w„/ in (5.1.21) of the first equation in (5.1.16) satisfy a square - 
integrability condition. First we extend the Unh defined on (0, oo) to continuous 
functions on [0, oo). Namely, using (5.1.21), we set 



Unl(0) = 



0 if />0 

in - l)!L‘_i( 0 ) = -in - 1 )!(L“)'( 0 ) if / = 0 



where the formula (L“)' = n > 1 is recalled. 

Theorem 5.1. 

poo 

Uniirfr^ dr <(x>. 

Jo 

The value of this integral in fact is 



2n[n-{l + l)]\ {n + l)\ 



^ 21+3 



(5.2.1) 



Proof Let 



=r 



Uniirfr^ dr 



Jo 

([n-(/ + l )]!)2 r- 









r^ dr 



J-l+3 



poc 

Jo 






r2/+l 

■"«-(/+!) 



(o]' dt. 



(5.2.2) 



By Theorem 4.4, in particular, 






Liit) 



2 (2m + y+ l)(m+y)! 

dt = j 

ml 



(5.2.3) 
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for integers m, j > 0, which gives 

^ ([w-(/+l)]!)^ (2n)(w + /)! _ 2n[n-(l + m (« + /)! 






[«-(/+!)]! 






as desired. 



(5.2.4) 



□ 



Because of Theorem 5.1 we choose the following normalizations of the radial 
solutions in (5.1.21): 



Thus for r > 0 



forC?'!' ± 






-Rt,(r) - ±1 






Write 



2n(n + /)! 

[Rt,ir)]^r^dr=\. 



u„,(r) Z'Ti9,<p) 

(g(r,0,4>)) = - ' ■ 



(5.2.6) 



(5.2.7) 



on R"*" X R X (0,;r), by (5.1.21), (5.1.28), and (5.1.31). We apply Theorem 4.14 
and Theorem 5.1 to compute 



II J dxdydz 

r2jT n/r »oo 2^^^^(/!)^ 

Jo Jo . 



u„i(rf Zj‘(e,4))Zj'{e,4>) r^sin^ drdd>d0 



lo 2;r[(/ + |m|)!]2 
22 W (/!)2 2n{n-l-\)\{n + iy.An{l + \m\)\ 

2;r[(/ + H)!P 

22N(/!)24«(«-/ - 1)! (« + /)! 

« r '(/ + H )!(/- N )!( 2 / + l )' 



a„^'+^(2/ +!)(/- N)! 



(5.2.8) 

That is, recalling that we have extended the continuously to all of R^, we have 

Proposition 5.2. The wave functions are square-integrable and continuous 

over M? and thus define elements of the Hilbert space .dxdydz), where 
dxdydz = Lebesgue measure on M?. Let 



d£f 0 

y^nlm — ^nlmWfil 

r^2/+3 



nlm 



forC„,„ = 



a^„‘+Hl + \m\)\(l-\m\)K2l+l) 



22 H(/!) 24 „(„_/_ !)!(„ + /)! J 



1/2 



(5.2.9) 




132 



5. Hydrogen-like Atoms 



Then 






‘ Jr3 . 



\Wnim{^.y.z)\ dxdydz = 1. 



(5.2.10) 



The normalized wave functions xf/nim of Proposition 5.2 can be explicated as 
follows. For integers /, m with / > 0, |m| < / let 



def 
Clm = 



(2/ + l)(/-N)! 

2(/ + H)! 



1/2 



(5.2.11) 



Then by (5.1.32), (5.2.9), again for n > 1, 0 < / < n - 1, -/ < m < /, we can 
write 



¥nlm(x,y, Z) = Ci„ 



\/2^ 



(n + /)!2n 

• z)\) y;”{x, y, Z). (5.2.12) 

Note that for an integer k > 0 and x G M 



Lk(x)=^ "‘x'') = klLlix) 
L^\x) = i-iyk\L{_jix) for 0 < 7 < 



(5.2.13) 



r(2/+l). 



'(x) = -(n + /)!L^'+ii(x); 



see Appendix 4B. Therefore for 



= ± 



j anin -/-!)! 
2n[{n + l)\]^ ’ 



(5.2.14) 



equation (5.2.12) can also be expressed as 

¥nimix, y, z) = CtaC- y, z)\)Yp{x, y, z) (5.2.15) 
\1k 

onK^ 



5.3 The Wave Functions for Small n 



In Chapter 0 we introduced the constant 



a„ = ^ = — ^ = 0. 52946 x lO'* cm; (5.3.1) 

see (0.0.11), (0.0.12). Historically ao is Bohr's radius, the radius of the smallest 
possible orbit of an electron about the hydrogen nucleus in the old, or earlier 
version of quantum theory. By (5.1.30) 



d£f 2^iZe^ _ 2Z 
nh^ ndo ’ 



def 2Zr 

Pn{r) = anV = . 

ndo 



(5.3.2) 
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Note that g{x,0,^/2) = (x,0,0), or T,'"(x,0,0) - 7“(g(x, 0,;r/2)) = 
x'Z”(0,./r/2) x'e''"'“P,''”'(cos(x/2)) = x'p/''"'\0); see (5.1.31). 

In particular for n = 1, / = m = 0, Yg{x, 0, 0) = 1, c^o = 1/ V2, - -\Ja\jl, 

a, = f , L^(x) = 1. By (5.2.12) 



V'loolx 



, 0 , 0 )= (- 

\ao 



2\3/2 exp(-||xl) 









(5.3.3) 



which as a function of x has a cusp at the origin: see Figure 5.2, where c = 
In particular ^^(0, 0,0) does not exist. 




Equations (5.2.5), (5.2.6), (5.3.2) imply that the normalized radial wave func- 
tions R~i(r), /* > 0, are given by 



2 /Z 



3/2 



l in-i-iy. 
(n + iy. 



p„(r)'exp(-p„(r)/2)i^'+i,(p„(r)); 



R:,(rfr^ dr = 1. 



For example. 



> ^\3I2 

W = 2(-) 



CloJ 









1 fz 



2V6 \a« 



3/2 



— Pe 



-p/2 






(5.3.4) 

(5.3.5) 



(5.3.6) 




134 



5. Hydrogen-like Atoms 




-P), 



where we write 



2Zr 

P = Pn{r) = . 

noo 



There are also normalized functions Xim on R, [0, 2k] given by 



(5.3.7) 



* /AN def e' 

^m{0) = 






yfln 



c2n 



de = 1 , 



ci„,pf"\cos<P) : 



XU<pf^in^d4> = \, 



(5.3.8) 



according to Theorem 4.8, given \m\ < 1. Again writing /i ® /2 for the function 
(x, y) ^ fi (x) / 2 (y) we can assert that 



¥nlm o g = ® d>m ® (5.3.9) 

on {{r,9,4>) e R^k > 0,0 < ^ < k], by the various definitions involved. The 
normalized wave function v^ioo is computed at the special point g(r,0,x-/2) = 
(r,0,0) in (5.3.3). We consider further computations of y/„im at points g(r,9,4>) 
for small values of «,/, m: 1 < n < 3; thus 0 < / < 2, -2 < m < 2. In what 
follows we shall write for ° g; i.e., for convenience, y/„im will mean 
R~,0^„,®X,„. 

We have 



^oo(^) = Y' 



Xw(4>) = 

X2o(4>) - 
X2±i(4>) = 

X2±2m = 
X30i4>) = 
X3±dd>) = 



V6 

Y cos((#>), 

Y sin(0). 



(3cos^(^)- 1), 



4 

Vl5 

— sm(^) cos(^), 
^ sin^(^). 



3VT4 



(i 



COS^(0) - cos(0) 



)■ 



^(sin(^))(5cos2(.^.)-l). 



(5.3.10) 
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for 0 < ^ < ;r. Now 



Therefore, 

¥2oo(r,0, 
¥ 2 io{r, 0 , 4>) 

W 2 i±\(r, 0 , 
W30o(r,e, (p) 
W3\o(r,e, <p) 



¥ioo = ^10 ^00. 

¥200 = ^20 ^ ^0 ^ ^ 00 . 
¥ 2\0 = ^21 ^0 ^ 10 . 
¥2\±\ = ^21 ^ ^±1 ^ ^1±1. 
¥300 = ^30 ^0 ^00 » 

¥310 = ^31 <S) Oo <8) X\Q, 
¥3\±\ = ^31 ^ ^±1 ® X\±\, 
¥320 = ^32 ^0 X20, 

¥32±1 = ^32 ^ ^±1 X2±l, 

¥32±2 = ^32 ^ ^±2 ^ ^2±2- 



/ zy V2 _ /zy" 1 

\cioJ V2 k 2 \aoJ y/jT 



p-p/2 



3/2 



- 

ao J y/^ 



,-{Zlao)r 



3/2 



(««) 2V2* 






1 V2 
V2^ 2 



3/2 g-p/2(2_p) 



z 

a. 



4V2^ 



( Z^'^ 1 



— cos(^) 



2V6 2 



-) 

flo/ 



AsTIk 



cos(^), 



3/2 



J_ 

2V6 

1 






-p/2/ 



„±/6> 



V3 



vS 2 



sin(p) 



^y/2 

flo / 8v^ 

3/2 



pc sin(0), 



( 



1 -n 






^0 / 18\/^ 

3/2 



VS 2 

g-p/2(6 - 6p + p^), 

1 V6 



f ^ ^ pe - p) ^ cos(^) 

V^^o/ 9V6 V2^ 2 



(5.3.11) 



(5.3.12) 
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V^31±l('‘ 






_1 

9\/6 



pe - /})cos( 0 ), 

.±i0 ^ 



18\/^ 

pe-pJ'^iA -p)^ 



\Fln 2 



sin(^) 



\cLo J 36 v^ 

Z\ 3/2 2 g -,/2 1 ^ 



V^320(^ 



V'32±i(A‘ 



,0,<^) = (- 

\ao 



9\^ VS 4 



(3cos^((^))- 1) 



/)2g-^/2 



36\/6^ 

Z\3/2^2^-,/2 ^±,-^ ^ 






/ 9V30 vS 2 



( 3 cos^(^)- 1 ), 

sin(^) cos(^) 






sin(^) cos(^), 



Wi2±i{r 



,0,4>)=(- 

\ao 



/jV'’/2e±2'»Vl5 . 2 ,^, 
— ;= — 7=— ;-sin(^) 
/ 9V30 4 

3J2 



= (^) smH4>)— • 

\a<,/ ^ 72V^ 



Note that 



[¥2nir,0,^) + \fr2i(-i)(r,9,i 



V flo / 8V 



pe 



-PI2 



^y/jt V2 



\ao) 4v^ 



pe sin(d>) cos( 0 ). 



( 5 . 3 . 13 ) 



■ sin(^) [e‘® -I- e“'®] ( 5 . 3 . 14 ) 



[¥2n(r,0,4>) - ¥2H-i)ir,0,4>)\ 

_. / 7 \ 3/2 , 

/ 1 

= ( — I — — pe“^/^sin(^)sin(^). 

\ao/ 4V27T 



Similarly, 

•^[W3ii(r, e, (p) + i//3U-i)(r, 0, 4>)] 

V2 



/ Z \ / 7 e ^/^(4 — p) sin(4>) cos(6) 

\^o ) \%sP1k 
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-p [W 3 u(r,e,(p)-\j/ 3 ^.i){r,e, 4 ))] = 

v2 

^[V^ 32 l(^ 6 >, 0 ) + l/^ 32 (-l)(r, 6 >,^)] = 
^ [w 32 i(r, 0 ,(p)-\i/ 32 (-i){r,e,(p)] = 



- p) sin(4>) sin(^) 
^0 ) 18 \/ 2 ^ 

Z \ p^e~^/'^ sin(^) cos(^) cos(^) 

^o) 1SV2^ 

Z \ p^e~^!'^ sin(^) cos(^) sin(0) 

^o) 18V5^ 



i[v^322('‘, + l/A32(-2)(r, ^)] = — 



Z \ Ip^e sin^(^) cos(2^) 



72V^ 



[v^322('‘, 6>, - W 31 {-l){r, e, ^)] = f — 



Z 2/?2e"^/2 sin2(0) sin(26>) 



72V2^ 



In general, 



/— T Wnl{-m){j’ <> ^)] 

V2 



- K,(r)— X,„(^^.) + ;?-(r)— 



V 2 



^-imO 



as Xi(^-fn) — Xiffi^ 



zR„i(r)Xir„i 4 >)cos{me). 



Similarly 



-p[¥nlm(r, 0 , - ¥nK-m){r, 0 , <p)] 

v2 



V2 



^imO ^—imd 



zR„,{r)Xi„( 4 >) sm(m9). 



In particular 



00 r 2 n J \ ^ 

( — [¥nim(r, 0 , 4 >) + y/„i(^-m){r, e, <p)] ) sin(^) d 4 > d 6 dr 
.0 Jo \v2 / 



■00 c 2 ti cn 



R^j(r)^Xifn(4>)^ cos^(m0)r^ sin(^) d 4 > dO dr 



Jo Jo Jo ^ 

1 

= - cos^(m6>) de = 1, by (5.3.5), (5.3.8). 

7 t n 



Similarly 



- [V^n/m(^> ^? ^) V^n/(-m )(^9 ^» ^)] 



is normalized, as 
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M 



X 

Figure 5.3: Angular momentum 



1 

- sm^(me)de^\. (5.3.20) 

^ Jo 

5.4 Quantization of Angular Momentum 

For« = 1,2, 3, . . . , / = 0, 1,2, . . m = -/, -/-fl, . . . , -1, 0, 1, 2, . . ., / there 
corresponds the normalized wave function y/nim of (5.2.12), or (5.2.15). As in the 
introductory remarks of this chapter, for the quantum data («, /, m), « is called the 
total quantum number, I is called the azimuthal or orbital quantum number, and 
m is called the magnetic quantum number. Note that the energy 
in (5.1.22) of the quantum system described by y/nim depends only on the total 
qu antum nu mber n. The azimuthal quantum number / determines the magnitude 
hy/l{l + 1) of the quantized angular momentum M of the electron with wave 
function y/nim^ a statement which we now proceed to attach a clear meaning. The 
magnetic quantum number m will prescribe, for example, the magnitude h\m\ (as 
it will turn out) of the :^-component of M and the spatial quantization of M given 
by cos 0 where ^ is the “angle between M and the z-axis.” 

The classical angular momentum M of a particle of mass m is defined by M = 
r X p where r = (x,y, z) is its position vector and p = (Px,Py,Pz) = is its 
linear momentum vector (cf. Figure 5.3): 

i j k 

M= X y z ={Mx,My,M,), (5.4.1) 

Px Py Pz 
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where 



Mx = yPz - zpy. My = zPx - xpz:, = xpy - ypx. (5.4.2) 

The quantization rules 



multiplication by x, y, z, 
h d h d h d 

PxfPy>Pz ~ ~ 3 ” 

I OX I oy I oz 

of Chapter 3 therefore lead to the quantization rules 




(5.4.3) 



(5.4.4) 



Let = Ml + Ml + Ml, which is the differential operator which represents 
the length^ = \r x p\^. We will show that on - {0}, 

M y/nlm ~ ^ ^zy^nlm ~ (5.4.5) 



Given (5.4.5), we therefore take hyl(l 4- 1) as the magnitude of the (quantum) 
angular momentum M, and we take h\m\ as the magnitude of the z-component of 
M; see (3.9.1). Graphically one has the following (Figure 5.4): 






hm 



M hy/i(i + 1 ) 



(5.4.6) 







Figure 5.4: Quantum angular momentum 
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z 




Figure 5.5: Spatial quantization of angular momentum 



Since the inequality \m\ < I restricts m to taking on only the 21 + 1 values 
-I- 1, . . . , 0, 1, . . . , the “vector” M can assume 21 + 1 distinct orientations 
only, with the angle of orientation ^ given by (5.4.6) as illustrated in Figure 5.4. 
For example, take I = 2: m = -2, -1,0, 1, 2, so that /(/ -1- 1) = 6 ==> cos 4> = 
0, The following graphic (Figure 5.5 for / = 2) therefore illustrates 

the spatial quantization of angular momentum. 

The five possible orientations for M are shown. co$4> = ^ ^ ^ — 

34^, 66^ respectively. The corresponding magnitudes for are (for m = -2, -1, 
0, 1,2) 2/z, h, 0. For + My + with My, M^ given by (5.4.4) 

we now establish that indeed (5.4.5) holds. 



Theorem 5.2. M^ij/nim = fi^Kl + ^)Wnim - {0}. 



It is certainly not obvious that M^ should act as a scalar on the wave function 

Wnlm- 

Let y/ : C R^ C be a function with continuous first and second partial 

derivatives on U. In particular. 



d^y/ d^y/ 
dydz dzdy ’ 
d^y/ d^y/ 
dzdx dxdz ’ 
dV _ dV 
dxdy dydx 



(5.4.7) 

(5.4.8) 



(5.4.9) 
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2 *2 / ^ d \ ( d\!/ d\ff 

M^w = -n yT - - - 1 :— 



dz dy ) 

dxj/ dif/ 



dz dy 



d f dy/ dy/\ d ( dy/ dy/ 
= yT-(y^-z—]-z—(y—-z— 



dz V dz 



dy \ dz dy 



d^xff 


d^y/ 


dy/ 


dPy/ dy/ d^y/ 




^ dzdy 


dy 


^ p dz ^ dy^ 



od^yr _ d^y/ dy/ dy/ id^y/ 

by equation (5.4.7). Similarly, permuting x, y, z cyclically, one has 

^yW _ 2d^W ry d^y/ dy/ dy/ 

-fP' ^ dx^ ^ dzdx ^ dz dx dz^ 
Mjy/ yd^y/ ^ dPy/ dy/ dy/ 



(5.4.12) 

(5.4.13) 



as equations (5.4.8) (5.4.9) apply. 

Adding, one gets 

, 2 2\ / 2 2\ / 2 2\ 

dV T dV T dV 

.2^^. 2,^ -2,^ 

dx dy dz 

( 1 2\ ^ / 2 2\ ^ / 2 2\ 

= (- -»)^ + ('- 

dPw dPw dPy/ 

„ r d d d ■ 

-2 :>ct- + y^- + ?:t- ¥ 
dx dy dz 



(5.4.14) 



for = x^ + + zr- 

For ^ ^ ^ and £ = + y|; 4- (= Euler’s differential 

operator) we can write (5.4.14) as 



AfV 2 t 72 2<?V 2^V 2<?V 



d^w dPy/ dPy/ 

■ 2yz^ - 2zx^ - 2xy^ - 2Ey/. (5.4.15) 

dydz dzdx dxdy 




142 



5. Hydrogen-like Atoms 



On the other hand note that 



d d d 



dxf/ dy/ dy/ 



dy/ dy/ dy/ 



dx \ dx 



X dy dz J 
dy/ dy/ dy/ 



dy/ dy/ dy/ 



+ ]+y—[x—+y— + z— 



dy \ dx 



dz V dx 



+ +^- 



dry/ dy/ d^y/ dry/ 

d^y/ d^y/ dy/ d^y/ 

d^y/ d^y/ d^y/ dy/ 

dzdx dzdy dzr dz 

2 d^y/ 2 2 I ^ 






dy dz 



aV , ^ , r, <3V 

dxdy dxdz dydz 



by equations (5.4.7), (5.4.8), (5.4.9) again. By equation (5.4.15), then. 



r^N^y/ -h Ey/ 



= -Ey/ - x^ 

= -eV ^ 



2 d^y/ 2 



Proposition 5.3. 



^ = -r^V\ + E\ + Ew 
tf 



= — + — + — 

dx^ ^ dy^ ^ de- ’ 

w e c^{U), {/"p"" c : 



- ^'t+y'k'y^Tz^ 



r^ = x^ + y^ + Z^. 



Now, v/„,„ : 



C is given by equation (5.2.15) in Section 5.2: 



Wnim(x, y, z) = a' exp (-|p) Lfit^\ar) ac Y,"‘(x, y, z) 



for c = Cnh a = a„.4> = Wnim is a function on - {0}. 
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Let 



^ I - ^Im 

C = a ac — — 

vS 

Hi{x,y,z) = e.xp[-^r^, (5.4.19) 

H2{x, y, z) = L^^l'\ar) y,'"(x, y, z) 



SO that (p = CH\H 2 . 
OnE^- {0}, 



Similarly 

dHi 



dH\ / a \ / a \ 1 

— = xexp 

a f ot \ 



2x 



+ ^2 



(5.4.20) 



Vx^Ty^T? 



dHi 

dz 



dHi a / a \ r oHi a / a \ ^ 

■aF = -2“'’(-y)7- 'IT =-2“P(-r)7' 



« / a \ ot 



(5.4.22) 



If fi,f2 are nice functions on an open set in we clearly have E{f\f2) = 
(E/0/2 + f\Ef2 since the product rule holds for In particular, on 



R'-{0}, 



E^Hi = E(EHi) = E (-f '•fi'i) (by (5.4.22)) 
= ~[(Er)Hi+rEHi] 

where 




dr 

yT = 

dy r 




Therefore 



, x^ + y^ + z^ 

Er = = r. 



(5.4.23) 



(5.4.24) 

(5.4.25) 



E^Hi = 



[rHi+rEHi] 



rHi +r 



a a o 

-r+r 



(-D 

Hu 



rHs 



(5.4.26) 
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Next we compute EH 2 ,E^H 2 . 

lfH 3 (x,y,z) = H 4 = then H 2 = H 3 H 4 with H 4 homoge- 

neous of degree / on : 

H 4 (tx, ty, tz) = t^H 4 (x, y, z) (by Euler’s theorem) 

EH4 = IH4 =» EH 2 = (EH3)H4 + H3EH4 (5.4.27) 

= (EH3)H4 + IH3H4. 

Now 

and similarly 



II 


(C) 


(ar) 


dr 

ay— 

dy 


(5.4.29) 


II 


(O") 


'(ar) 


dr 

ocz— => 
dz 




EH 3 = 


(C*") 


! 

(ar) 


aEr 


(5.4.30) 


= 


(C‘) 


/ 

(ar) 


ar 


(by equation (5.4.25)). 



By the same argument (replace L^l'l^\ar) by (ar)) 

E (if+r*)' («'•) = (4+r^)" («r) ar =» (5.4.31) 

E^H, = E (^EH, = (ar) ar) 

= (-^1+r*^) (ar) arar + (ar)Ear (5.4.32) 

= ) («'■) + {eZi ^^ ) 

By equation (5.4.27), 

EH 2 = (EH 3 )H 4 + IH 2 H 4 = (ar) arH^ + IH 2 H 4 

= ) («'•) arH4 + lH2, 

and 

E^H 2 = (E^H 3 )H 4 + (EH 2 )EH 4 + IEH 2 

= [«^f^{Ezr^) («'•)+ («‘i-)]h4 

+ ((^f+r /(«'•) 

+ l[(LZt'^) (ar)arH 4 + lH 2 . 



(5.4.34) 
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By thtformulasfoT EHu E^Hi, EH 23 .nd E^H 2 , 4 >/C = H2 onM? - { 0 } 



E^ = {EHi)H2 + HiEH2 



- (^~rH,^H2 + Hi [( 



H2 + Hi \( (ar) arH, + IH2 



« 4> , ,4> 



n+/ ) 

»v 

n+l ) 



(5.4.35) 



= +l^+ar V 

_ r 172 TT \TT i/I70'\I7tr \ f J7 U \ J7 TJ 1 TJ 172 i 



= {E^Hx)H2 + {EH^)EH2 + {EHx)EH2 + 



a a 



-r+r‘ 



H 1 H 2 

+ 2 ((0'7 “'■^4 + '^2) 

+ /fi{[a2/-2(Lf^;‘^) (ar) + ar(Lf^;") (ar) 

+ (ar) ar/ff 4 + 1 («0 ccrH, + IH 2 ] } 






+ [-(ar )2 + «/■ + ar/ 2 ] {ar)H\Hi 



(5.4.36) 



{E + E^)^ = 



a , a «"• 2 , ,2 

--r + l--r+ — r - ar/ + P 
2 2 4 



C 



+ [ar — + ar + 2 ar/] (ar)H\Hn 



+ 






-ar + / + 



+ arHiHn 



2 2 

a 



- arl + V 



4> 



(ar) + (2 + 2 l (ar) 

(5.4.37) 



By the differential equation for (see Appendix 4B) 



^ W + (2/ + 2 - X) (X) + (« - / - l)Lf^y\x) = 0 

(5.4.38) 
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we obtain for x = ar, 









2 2 



—ar + / H ^ arl + 

4 



C 



+ arHiH^il + 1 - n)Lf^f’\a/-), (5.4.39) 



where = H 4 H 3 = Hi. That is, 



{E + = 



2 2 



-ar -h / -I — arl + I + ar(l + I - n) 



d) 

^ (5.4.40) 



(£ + E^)\l/nlm = 



2 2 

a ^ ,2 

/ -I — - — vr- arn 
4 



Wnlm (5.4.41) 



on - {0} ! 

On the other hand, on - {0}, ^ satisfies the Schrodinger equation (5.1.27): 






or 



(5.4.42) 



= 



2r , 

f .El + 



for qi = -e, qi = Ze. By Proposition 5.3 
M^4> 



4>, 






= -r^V'^ 4 >+{E + E^) 4 > (onM^-{0} where ^ is C“) 



2 2 



, a"r ,9 



(5.4.43) 



by equation (5.4.41). Here, by (5.1.22), (5.1.23), 
^ 22 , 



2 rfih^ 



and a = an = 






2 qi^ 

ntf 






Similarly 






q\qi — an]r = 0 r = 0 . 



so that we finally arrive at the equation 

2 

-^Vnlm = {l + l )Wnlm, 

or MV»;m = (/ + 



onR^- (0), 



(5.4.44) 

(5.4.45) 

(5.4.46) 

(5.4.47) 

(5.4.48) 




5.4. Quantization of Angular Momentum 



147 



as desired. □ 

Maintaining the above notation, we have H 4 = Yl^. Recall the formula 

(‘I -4) 

of Appendix 4A. Hence by (5.4.4) 

Proposition 5.4. = hmYl^. 

As a corollary we now prove the second equation in (5.4.5): 

Theorem 5.3. Mzy/nim = - {0}. 



Proof. In the above notation, 



+ H,M,H2. 



(5.4.50) 



We claim MzH\ = 0 : By the above calculations (or by direct recalculation) on 

R3- {0}, 



dHi _ 


a ( 


a \ X 


dHi 


dx 


-2exp( 


~2J 7’ 


dy 


so that clearly 








dHi 


dHi _ 


a { a \ 


i , 


X— y 

dy 


dx 


-- exp --r 
2 2 ) 


• 7 - + 






(5.4.52) 



M,Hi = 0. 



Hence by (5.4.50) 



= HiM,H 2 = HiM.iHiHi) = Hi[{M,H3)H4 + HiM.Hil (5.4.53) 



where = hmH^ by Proposition 5.4. We claim that = 0, which thus 

would give 

Mr- = HiH 3 hmH 4 = hmHiH 2 or 
"C (5.4.54) 

Mz4> = fimCH\H2 = hm^. 



as desired. 

By the above calculations (see (5.4.28), (5.4.29)) 



dH^ 

dx 





where 



dr X dr y 

dx r ’ dy r 



(5.4.55) 



(5.4.56) 




148 



5. Hydrogen-like Atoms 



on - {0}. Hence 



dH3 dH3 




(5.4.57) 



as claimed. 



□ 



The proof of Theorem 5.2 was a bit long and non-conceptual. It is possible 
to give a group-theoretic proof based on some results of Chapter 7 and ideas in 
Appendix 9C. 



5.5 The Average Distance of the Electron 
from the Nucleus 

Consider for example the normalized wave function y/ - y/ioo with total quantum 
number 1 and azimuthal and magnetic quantum numbers equal 0. y/ governs the 
properties of the hydrogenic atom in its normal or ground state — i.e., the state of 
lowest energy; compare remarks in Section 5.8. By Max Bom’s basic postulate, 
is the probability distribution for the electron about the nucleus: 



\y/^ dxdydz = 

. J 



the probability of finding the electron 
in a region of space 91. 



(5.5.1) 



For example we compute the probability P of finding the electron within a 
sphere about the origin of radius Qq = the Bohr radius. By (5.5.1), (5.3.6), (5.3.8), 
and (5.3.9) 



P = 



cn 



c2n 



R^^{rf\^o{0)\^Xoo{(i>fr^ sin^ drdOdcp 



JO J 



= 4 



exp 



(-f) 



dr. 



Now 



\ a J 



for a 7 ^ 0 by two integrations by parts. Choose a = IZjao to get 
P 



(-)’ 








+ M’ 


\ao) 


2Z 


4Z2 


8Z3 


8Z3 



P = -2Z^e~^^ - 2Ze~^^ - + 1. 

For the hydrogen atom, for example, Z = 1 => P = -5e~^ -I- 1 
-h 1 = 0. 3235. 



(5.5.2) 



(5.5.3) 



i.e., (5.5.4) 

(5.5.5) 
= -5 -(0.1353) 
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The expectation value of the position of the electron, i.e., the average distance 
r of the electron from the nucleus is given by 

r ^2 (ixdydz. (5.5.6) 

. Jr3 . 

For the ground state we obtain, as in (5.5.2) 

,5.5.7) 

V^o/ Jo V ^0 / 

Now 

r n\ 

dr = — - (5.5.8) 

Jo 

for a > 0, « = 0, 1, 2, Thus choose a = IZjao to obtain r — — ; thus r -\ao 

for hydrogen. The electron does not move in a circular orbit of radius Og about the 
nucleus of hydrogen but maintains an average distance ^Qo> Oo from the nucleus, 
as pointed out in Chapter 0. We have shown, moreover, that (for hydrogen) the 
probability of finding the electron within a sphere about the origin of radius Qq is 
0. 3235, in the ground state. 

We can compute the average distance r„/^ of the electron from the nucleus for 
an arbitrary state y/nim of the hydrogenic atom. As in (5.5.6) 

[[ \ \¥nim{x,y,z)f dxdydz- (5.5.9) 

J Jr3 J 



By (5.3.4), (5.3.8), and (5.3.9), as p„(r) = ^ 




by formula (4.7.23) of Corollary 4.5: 



Theorem 5.4. 



aoti^ 






/(Z + D V 

) 



is the average distance of the electron from the nucleus of a hydrogenic atom with 
quantum state ij/nim- 
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In particular for the ground state v^ioo we obtain rioo = ^[l + ^l = 
before. 



5.6 The Ritz-Rydberg Formula: Further Examples 

In the discussion in Section 1.2 of Chapter 1 of the Bohr frequency rule and 
emission spectra we were led to Balmer’s formula and, more generally, the Ritz- 
Rydberg formula. We saw that the proof of those formulas (given the Bohr fre- 
quency rule) depended on the energy levels formula (1.2.4) stated there with- 
out proof. Now that we have established this energy levels formula (in a slightly 
more general form in equation (5.1.29)) the discussion of Chapter 1 attains a fur- 
ther sense of completeness. By equation (5.1.29) we obtain in fact the following 
slightly more general Ritz-Rydberg formula for a hydrogenic atom: 




for the transition 



= z2(109678) 




^ ri2>nu 




cm 



(5.6.1) 



from a higher to a lower energy state, in which case radiation of wavelength A is 
emitted. If ri 2 < n\ then, as we have seen, radiation is absorbed in the transition 
^(« 2 ) (fj-Qjjj a lower to a higher energy state), the wavelength A being 

given by 



\ = Z^(109678) 

A 





(5.6.2) 



As we consider further computational examples of the Ritz-Rydberg formula the 
reader may wish to review some of the content and notation of Chapter 1 . 

For the Lyman series of hydrogen (where Z = 1) choose = 1. For the first 
Lyman line choose ri 2 = 2 to obtain the wavelength of the emitted radiation, 
by (5.6.1): 

i = 109678 I 4 - 4 I cm-' = 82258. 5 cm"' 

A \12 22/ (5.6.3) 

^ A = 1216 A. 



Note that as «2 increases, j increases; i.e., A decreases. The value 1216 A in (5.6.3) 
is therefore the longest wavelength in the Lyman series; the shortest wavelength 
Aoo is obtained by letting H 2 ^ oo: A^ = 109678 cm“^ => Aoo = 91 1. 7 A. The 
frequency v of the first Lyman line of hydrogen with wavelength A = 1216Ais 
given by 




3 X 10^® cm/sec 

1216 A 



= 2.467 x 10^^ Hz. 



(5.6.4) 



During the transition E^^^ for hydrogen a photon is emitted. We calcu- 

late the energy and the momentum of this photon. By equation (1.1.1) the energy 
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E is given by E = hv\ here v = ^ is given by (5.6.1): 

V = C109678 ( 4 - 4 I cm“‘ = 3 x lO'® x 109678 x - sec”* 

Vl2 3V 9 

= 292474. 67 x 10‘° sec”* (5.6.5) 

E = 6. 62608 X 10”^’ erg-sec x 292474. 67 x 10‘® sec’’ 

= 1937960. 5 X 10“^^ ergs 12. 1 eV; see Appendix 8A. 

The momentum p of the photon is given by de Broglie’s wavelength formula 
2 = hp~^. That is, as E = /iv = hcX~^ cp,p = Ec~^ = 1937960. 5 x 10“^^ ergs 
x(3 X 10^^ cm sec-i)-^ = 645986. 83 x lO'^'^ g cm sec“^ 

( 3 ) 

In the next example we compute the wavelength 2 of the second line in the 
Paschen series for hydrogen. Thus in (5.6.1) take n\ = 3, ri 2 = 5 to obtain 

i = 109678 f ^ - 2 ) = 7799. 3244 cm-‘; i.e., 

A \9 25 J (5.6.6) 

A =.0001282 cm = 12,820 A. 



A p-meson (or muon) has the same charge -e as an electron but its mass is 207 
times the electron mass m^. Consider a /^-mesonic lead atom with Z = 82 protons. 
To obtain the nth energy level E^"^ of this atom replace by 207mg in (5.1.29). 
In fact replace the relative mass mempf(me + mp) =. 99m^ (by (0.0.10)) by to 
obtain 



^ _(207X82)^ 



= 1391868 



(5.6.7) 



where is the nth energy level of hydrogen. The transition £*9 (again 

from a higher to a lower energy level, where radiation is emitted) gives rise to the 
first Lyman line (of //-mesonic lead) with frequency 



E(2) _ eO) 



= 1391868 





= 1391868 X (the frequency of the first Lyman line for hydrogen) 
= 1391868 X 2. 467 x lO'^ cm-' (by (5.6.4)) 

= 3433738.4 X 10*^ cm"'. 



(5.6.8) 



In 1930, Ericson and Edlen (in Zeits. f. Phys. 59, 679) reported that the first Ly- 
man line of doubly ionized lithium Li"^"^ (i.e., lithium with two electrons removed, 
with Z = 3, as we have seen) and the first Lyman line of triply ionized beryllium 
Be'^'^'^ (beryllium with three electrons removed, with Z = 4) have wavelengths 
ke - 135. 02 A, 75. 94 A, respectively, in the high ultraviolet range; cf. Table 1.1. 
We can compare these experimental values with theoretical values kt predicted by 
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quantum mechanics. Namely, for Z = 3, we obtain by (5.6.1), (5.6.3) 



— = Z2(82258.5)cm“‘ 

=> 2, = ^ 1216 A = 135.11 A, 



(5.6.9) 



which is in good agreement with the value Ae = 135. 02 A for Similarly for 
Z = 4, equations (5.6.1), (5.6.3) provide the value 



At — 



]_ 

16 



1216 A = 76 A 



(5.6.10) 



for Be’^'*”^, which is in good agreement with the experimental value Ae = 75. 94 A. 

As a final example, the following question is posed: Does there exist an « > 1 
such that a hydrogenic atom can absorb radiation in the x-ray range (say A = 2 A) 
via the transition recall Table 1.1. The answer is easy to come by. 

Namely, equation (5.6.2) requires that the wavelength A of the absorbed radiation 
be given by 

i-z2(109678)^l-2^ (5.6.11) 

For A = 2 X 10“* cm this means that 




= 1 - 



1 

2 X 10-»Z2(109678) 
460 Z2 - 460 
“ ~Z2 ■ 



(5.6.12) 



Since ^ > 0 we must have Z^ > 460, or Z > 22. The answer is therefore “no” 
if 1 < Z < 21. On the other hand, for Z^ > 460 

Z^ 

— = > 4 z^> 4(Z^ - 460) 

Z2-460 

=> 3Z^< 1840 ^ Z <24.166 (5.6.13) 

=» Z = 22,23, or 24. 



However, the three values Z = 22,23,24 force = 484/24, 529/69, 576/116, 
respectively, which is not possible for n integral. Thus in general the answer to 
the posed question is ''no” 



5.7 Wave Function Orthogonality 

Equation (4.8.6) expresses orthogonality of the normalized wave functions 
of the quantum harmonic oscillator. Similarly, one can establish orthog- 
onality for normalized hydrogenic wave functions. From equation (5.3.9), y/nim ° 
g = R~j <Sf Xim on {(r,^,0) G | r > 0,0 < ^ < ;r}. As a consequence 
one indeed has 
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Proposition 5.5 (Wave Function Orthogonality). 



WnlmWnl'm' dxdydz = 6(^i^rnUr ,m')^ (5.7.1) 



Proof. The left-hand side of (5.7.1) is 



I — (}Pnlm’>WnVrri) 



'Ik 


'K 


. 0 . 


0 J( 


'Ik 


'K ' 


Jo * 


0 Ji 



Wnim{g{r, 0, 4>)) sin ^ d4)d0dr 



R-Ar)^,,(e)XUm:iXr)^m^^^ sin cf dcfdO dr. 



(5.7.2) 



Now since 

’2/r c c o 1 

dO ■ =^- ' — d0 = 

.0 Jo 

I vanishes unless m = m', in which case 



I = 



r 



Kii'-)R:rir) r^dr X,„,{(P)XU4>) sin 4> #, 



where 



(5.7.3) 



(5.7.4) 



def. (5.3.8) 



•;r 

.0 



Xi^(4>)Xrm(d>) sin 4> d4> 

CK 



Jo 



pj"’^ (cos (cos <p) sin (p d4> 

2 (/' + H)! _ 

2/' + l(/'-H)! 



(5.7.5) 



by Theorem 4.8 of Chapter 4, and by definition (5.2. 1 1). That is, I vanishes unless 
m = tri and / = in which case 



= 1 



by (5.2.6). 



(5.7.6) 

□ 



Since orthogonality implies linear independence we have 

Corollary 5.1. The normalized wave functions [y/nim]^fn=-i linearly indepen- 
dent over C. 
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5.8 Ground State Energy of Hydrogen-like Atoms 

The normal state of a hydrogen atom is determined by the wave function v^ioo, 
where n = 1: 

noo(r,0,4>) = ( — ) — 7 ^ (5.8.1) 

\ao/ 

by (5.3.12). Its lowest, or ground state, energy -Eh is given by (5.1.29): 

def ( 1 ) 4845. 9354 x 10“®* dyne^ cm'^ gr 

-Eh -E - - 2. 2242358 X 10-54 erg2sec2 

® (5 8 2) 

= -2. 1786968 x 10“" ergs 
= -13.59 eV 

by (0.0.2), (0.0.3), (0.0.10), where 1 erg = 6. 242 x 10" electron volts (eV). See 
Appendix 8 A. 

By (5.1.29) again, the ground state energy of a hydrogenic atom with atomic 
number Z is given by 



E = -Z^Eh. 



(5.8.3) 



Appendix 5A Commutation Relations for Angular 
Momentum Operators 

The angular momentum operators M^, My, Mj of definition (5.4.4) are subject 
to the commutation relations 



[M,,My] = ihM„ 
[My,M,] = ihM^, 
[M„MA = ihMy, 

which are easy to check. For example, by the product rule. 



(5.A.1) 



d d 

- y 

az ox 



cP- 

dzdx dx 






= -r 



= 



d d d d d d d d 



(5.A.2) 






rJ— — 

“ dzdx dx 



- Z 



dydx dz^ dydz j 






(5.A.3) 
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Similarly 






= -r 



d d d d d d d d 

ox az oz oz ox oy oz oy 



(5.A.4) 









zy 






dxdz dz} ^ dxdy 



+ X 



^dzdy 



111- 



Using the equality of mixed partial derivatives one obtains from (5.A.3) and 
(5.A.4) 



, def 



[M„My] = M^My-MyM^ 



= {''I 



(5.A.5) 



The other two commutation relations in (5.A.1) are similarly verified. 

In addition to the relations in (5.A.1) one has the commutation relations 



[M^ = [M^ My\ = [M^ M,] = 0. (5.A.6) 

For example 

ml M,] “= MmxM, - MmxM, 

= MyiMzMx - ihMy) - mmz + ihMy)M^ (5.A.7) 

= -ihiMxMy + MyM„) (by (5.A.1)). 

Similarly by (5.A.1) 

mlM,] = MyMyM^-MmyMy 
= ih(MyM^ + Mmy)- 



As [Mj, Mz\ = 0 we see that 

ml M,] - [Mx + Mj + Ml M,] - 0 

by (5.A.7), (5.A.8). Similarly [M^ My] ^ 0 and [M^MJ = 0. 

The commutation relations of (5.A.1) will serve as the starting point in Chap- 
ter 9 for the construction of certain spin operators — a construction that will be 
based on the representation theory of the complex Lie algebra 5/(2, C). 




6 

Heisenberg’s Uncertainty Principle 



6.1 Heisenberg’s Inequality 

We establish in this section an integral inequality 



(x - dx 



f| 

Jr 



ly/'Mr dx - 2ia 



i//'(x)i/a(x) dx 



R 



+ a \wW\ dx 

R 



1 

> - 
“ 4 



\y/(x)f dx 



, ( 6 . 1 . 1 ) 



referred to as Heisenberg ’s inequality, for a sufficiently nice function i// : M C 
where a,h eM.. Even though y/ is allowed to be complex- valued the second factor 
on the left-hand side of (6.1.1) will be non-negative. This inequality, which is 
a purely mathematical result of course, will be shown to form the basis for a 
precise formulation of Heisenberg’s uncertainty principle. The discussion of this 
principle in Chapter 1 was brief and non-rigorous. For now, we proceed strictly 
along mathematical lines. 

Before getting into a proof of (6.1.1) consider first an example. Fix a > 0 and 
define i// : E R by 



\l/(x) = sin —X on [0, a], 
a 

y/{x) = 0 off[0, a]. 



( 6 . 1 . 2 ) 



For b eR consider 
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Jr 

r 



b)^\y/{x)\^ dx 



= 1 (x^ - 2xb + b^) sin^ -x dx. 



Now for ^ G R - {0} 



[ x^ sin^ px dx = ^ ^ cos 2fix ^ ( P^x^ - sin 2px + < 

J 6 4/?2 ^ 4/?3 V 

Choose P = jt/ a to obtain 



f“ 2 . 2 ^ j 
X sm —xdx= — 


1 1 ■ 


Jo « 2 


3 



Also 



2 ^ j xsin2^x cos2j9x 



X sin Px dx = - 



4P 



8/?2 



+ C, 



’ . 9 ^ , sin2^x X 

sin pxdx = 2 



so that 



K 






xsin^ “X dx = 

Jo 4 

V j a 

sin —X dx = — , 
In OC 2 



together with (6.1.5) implies that 



1 1 

3 27t^ 



ba^ b^a 



We shall choose, for example, ^ = f which is the mean value 
Jo x\ii/ix)\'^ dx ^ a 



Jo“ lv^(x)P dx 
Then (by (6.1.10)) 



(by (6.1.8), (6.1.9)). 



ba^ b^a 
~^~2 



a'’ 

'Y 



T - ^ 

ISL = 

2 4 



Next 



cos^ Px dx = - + 



sin 2px 

~TT 



+ c and 



(6.1.3) 

(6.1.4) 

(6.1.5) 

( 6 . 1 . 6 ) 

(6.1.7) 

(6.1.8) 

(6.1.9) 

( 6 . 1 . 10 ) 

( 6 . 1 . 11 ) 

( 6 . 1 . 12 ) 

(6.1.13) 
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imply that 



cos fix sin fix dx = 



sin^ fix 

2fi 



+ c 



(6.1.14) 



ly/ (x)\^ dx = — \ cos^-xdx=—, (6.1.15) 

Jr Jo « 2a 

y/'(x)\p(x) dx = - \ cos -X sin —X dx = 0. (6.1.16) 

Jr a Jo a a 

For the choice b = a /I we therefore have by (6.1.9), (6.1.12), (6.1.15), (6.1.16) 



(x - bY\\i/{x)y dx 



W\x)^ dx - 2ia 



y/'{x)y/{x) dx 



•"'I 

Jr 



|i//(x)p dx 



def . , , 

= 4/# I — + a 
2a 



= a 



> a 



a2 

> — 
" 2 



1 _ J 

6 JT 

1 _ J 

6 7T 



2 2 

2a 2 

4 

2 






\y/(x)f' dx 



(6.1.17) 



which is a verification of Heisenberg’s inequality for the example at hand. 

Consider now a proof of (6.1.1). Although we could get by for less we shall 
assume for convenience (as Weyl does in [87]) that : E ^ C is continuously 
dilferentiable with yr, y/\ xyr e L^(E, cfx); dx = Lebesgue measure on E. By 
Holder’s inequality xy/y/' e L^{R,dx)\ hence xy7y/' e L\R,dx). Similarly by 
Holder’s inequality x^y/y/ e L^(E, dx), and we can define three real numbers 

A = [ x^\y/{x)\^ dx, 

Jr 

B = [xy/(x)y/'{x) + xy/{x)y/'(x)] dx, (6.1.18) 

Jr 

C = I |v/'(x)p dx. 

Jr 

For r, X G E arbitrary 



\txy/(x) + y/'(x)\^ = [txy/{x) + y/'(x)][txy/(x) + y/'(x)] 

= t^x^\y/(x)f' + txy/'(x)y/{x) + txy/(x)y/'(x) + \y/'(x)\^ 

(6.1.19) 
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which implies that 



\txy/{x) + i//'(x)p dx Af BtAC. (6.1.20) 

.R 

That is, + J3r + C > 0 V t G M - 4AC < 0. Note that the integrand 

for 5 is a continuous element in L^(R, dx) by our hypotheses on xj/. In particular 



=.i„r 

f“ d 

= lim X— 
dx 

= lim I 

a-^00 I 

=-l. 



[xxi/(x)ii/'{x) + xy/{x)y/\x)] dx 



[y/{x)y/{x)] dx 



[xy/(x)\i/(x)Y_^ - 
\y/{x)\^ dx 



\l/{x)y/{x) dx 



} 



(integration by parts) 



( 6 . 1 . 21 ) 



as x\xi/(x)\^ 0 when x ±oo, by the properties of y/, a short proof of which 

can be found in Appendix 1 of [87]. In fact, in general, for y/ e L^{R,dx), 
lim^^oo xlly/ix)]^ + \y/(-x)\'^] = 0. 

The statement - 4 AC < 0 becomes 



\y/(x)\^ dx 

Ur 



<4 



x^\y/{x)\^ dx 

Jr 



|v/'(x)p dx 
Jr 



( 6 . 1 . 22 ) 



which is (6.1.1) for a, Z? = 0, and which we can generalize a bit as follows. For 
a,h e M arbitrary let f{x) = c'"^i//^(x+Z?). Then f\x) — e^^^y/' {x+h)+iae^^^yr{x+ 
b) => /, /' G L^(M, dx) with /' continuous. Also x/ G L^(R, dx): 



^^l/(^)P dx = 

R 



f 

Jr 

f 

Jr 

I 

Jr 



x^\y/{x + h)^ dx 

(x - h)^\y/{x)^ dx 

(x^ - Ixb + b^)\y/{x)^ dx < oo 



(6.1.23) 



by Holder’s inequality, as above, as y/,xy/ G L^(R, Jx). That is, / satisfies the 
conditions assumed for y/. As 



f l/wp 

Jr 




\y/{x + b)f dx = 

R J 


|Va(x)P dx 

R 


conclude by (6. 1 .22) applied to / 






[ \v(x)\^dx 
Jr 


VI 


x^\y/(x + b)\^ dx 

Jr 




[ l/'(^)P dx 

Jr 



(6.1.24) 



(6.1.25) 
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where 



|/'(x)p = + b)-h ia\i/{x + b)]e~^^^[\//'(x + b) - iaxj/(x + b)] 

= \y/'{x + b)\^ + ia\i/(x + b)ii/'{x + b) - iay/'(x + b)\j/{x + b) (6.1.26) 
+ a^\\f/(x + b)\^. 

Applying the integration by parts argument again we get 



if/(x)y/'(x) dx = lim 



= lim I 

a^oo I 



y/(x)y/'{x) dx 



[W(x)w{x)t, - 



y/'{x)y/{x) dx > (6.1.27) 



} 



\l/\x)\i/(x) dx, 



and using again the translation invariance of dx we see by (6.1.26) that 



\f(x)\^dx = 






-2ia \\f' 
Jr 



(x)y/(x)dx+a^ 



\w(x)\^dx. (6.1.28) 



In particular the right-hand side of (6.1.28) is non-negative. By the inequality 
in (6. 1 .25) we obtain indeed the Heisenberg inequality in (6. 1 . 1 ) . □ 

The non-negative factor 



f dx - lia \ if/'(x)y/(x) dx + \ 

Jr Jr Ji 



\¥(xr dx 



= l/'(x)P dx 

.R 



(6.1.29) 



in (6.1.28) (for f(x) = e^^^y/(x + b)) can be expressed in an alternate way if we 
assume a bit more about y/. Assume, for example, that y/' satisfies the conditions 
imposed on yr. y/' is continuously differentiable with y/',y/",xy/' e L^{R,dx). 
In particular y/"y/ e L^(R,dx) by Holder’s inequality, and as above \y/(x)\^, 
\y/(-x)\^, \y/'(x)\^, |i//^'(-x)p ^ 0 as X 00 . It follows also that y/'(x)y7(x), 
y/'(-x)y7(-x) 0 as x ^ oo. Integration by parts therefore gives 



y/'(x)y/'(x) dx - - \ y/"{x)y/(x) dx 

[ Jr 

which permits us to write 

Fy/ = [~¥”ix) - 2iai//{x) + a^\y(x}]w(x) dx 

Je 

/ iM _ 

y/ I (x)y/{x) dx 

to obtain the following version of Heisenberg’s inequality (6.1.1): 



(6.1.30) 



JR 

/ 

1 -^ 1 
\i a\ 

dx 




(6.1.31) 
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Theorem 6.1. For a sufficiently nice y/ £ L^(R, dx) (y/ real or complex-valued) 
let 



F^{a) = f (Dl 

m 



y/)(x)y/(x) dx 



(6.1.32) 



for a e R, where Da = i-^ - al; (Dly/){x) = -y/”(x) - 2iay/'(x) + a^y/{x), 
X € M. Then Fy^(a) > 0 and for b eR 



FAa) 



f (X- 

Jr 



b)^\y/(x)\^ dx>\ 



dx 



(6.1.33) 



Equation (6.1.33) holds if, for example, y/, y/\ y/'\ xy/, xy/' £ L^(R, dx) with 
y/'' continuous. 



6.2 Heisenberg’s Uncertainty Principle 

As indicated in Section 1.5 of Chapter 1 the uncertainties Ax, Ap in the posi- 
tion and momentum x, p of an electron are subject to Heisenberg’s uncertainty 
principle 



AxAp>^. (6.2.1) 

Thus as the uncertainty in x decreases, the uncertainty in p increases, and similarly 
Ax increases as Ap decreases. However the argument which led to (6.2.1) was 
heuristic only. We did not formally define Ax, Ap in fact. The purpose of this 
section is to provide a meaning for Ax, Ap and thus to establish (6.2.1) on more 
precise grounds. We shall provide two formulations of (6.2.1), and two proofs. 
To this end suppose in general that 4> is some distribution function on R: 

£ L^(R, dx) such that ^(x) dx = I, ^ > 0. (6.2.2) 

Jr 



If / is a Lebesgue measurable function on R such that G L^(R, dx), then we 
can define the mean or expected value / of / with respect to ^ by 



/ = fM<p{x) dx. 

.R 



(6.2.3) 



Of course / also denotes the complex conjugate of /, which could cause con- 
fusion at times. In particular for 0 = \y/\^ where yr is some normalized wave 
function, and for /(x) = x we have, as considered before, the mean value 

X = x\y/{x)^ dx £ R. (6.2.4) 

Jr 



For example, compare Section 5.5 of Chapter 5. 
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There are quantities in classical physics like momentum p that can (at times) be 
expressed in terms of the coordinate x, or the coordinates x, y, z. This is not possi- 
ble in quantum mechanics however as one would have a violation of Heisenberg’s 
uncertainty principle. The equation analogous to (6.2.4) 



p = p(x)|v^(x)p dx 

• K 



(6.2.5) 



thus has no meaning at the quantum level. The following question arises there- 
fore: how should the mean value p of p (with respect to \y/\^) be defined? To 
answer this question we first quantize the classical observables x,p : x ^ Mx = 
multiplication by x, f before. Next think of x in (6.2.4) as an inner 
product: 



X = 



(Mxy/)(x)ii/(x) dx = {MxW.w)‘ 



Jr 

Then the quantum analogy of (6.2.6) for p would be 



( 6 . 2 . 6 ) 



_def 

P = 



h d 
i dx 




wM dx. 



For the example 



(6.2.7) 



V/(x)= [ 

0 X € M — [0, cf] 

where a > 0 (compare definition (6.1.2)), 



h27T ( n , n ^ 

p= — - cos -X sm -X dx = 0; 

/ cr2 Jq a a 

cf. equation (6.1.16). 

Note that by (6.1.27) 



y/'(x)y/(x) dx = — \ y/(x)y/'{x) dx 

Jr Jr 

(for y/ sufficiently nice). Hence in (6.2.7) 



h ^ ^ ^ ^ , , h d . 

P = -\ -¥ix)w'(x) dx = {xp, - —y/) 

Jr I I dx 

= the complex conjugate of p. 



( 6 . 2 . 8 ) 



(6.2.9) 



( 6 . 2 . 10 ) 



( 6 . 2 . 11 ) 



That is, p € R; T ^ is “self-adjoint.” 

Given the notion of the mean value / of / with respect to some distribution 4> 
as in (6.2.3), one automatically has a corresponding notion of the variance c^(f) 
of / and the standard deviation o{f) of /. Namely 



<y\f) = (/ - f)\ <Tif) = + V^. 



( 6 . 2 . 12 ) 
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A formal meaning of the uncertainties Ax, Ap can now be given. For a sufficiently 
nice wave function y/, Ax, Ap are the standard deviations 



Ax = <t(x). 



Ap = a{p). 



(6.2.13) 



Thus 



0 < (Ax) 2 “= (x - x)2 



(x - x)^|va(x)P dx 

m 



^\y/(x)f' dx - lx x|i//(x)p dx + (x)^ dx (6.2.14) 

Je Je 



= x"2 - 2xx + (x)^ = x’2 - (X)^ 



and 




(6.2.15) 



since the quantization rule P ^ f ^ leads to {p- p)^ ^ (f ^ ef. (3.9.2). 

Choose a = -p/fi G E and note that 




(6.2.16) 

(6.2.17) 

(6.2.18) 



By Theorem 6. 1 we see that ( Ap)^ > 0, so that Ap + ^(Ap)^ is indeed well de- 

fined, and that for the choice b = x eRwo have (using (6.1.33)) (Ax)^(Ap)^ /ff' 
>1/4. Hence we have proved the following. 



Theorem 6.2 (Heisenberg’s Uncertainty Principle). Let y/ e L^(R, dx) satisfy 
the conditions of Theorem 6.1 and satisfy 



\y/(x)f‘ dx = 1. 

E 



Let X, p eR be the mean values with respect toy/ (or expectation values) defined 
in (6.2.6) and (6.2.7), interpreted physically as the quantum mean values of the 
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position and the momentum x, p of some particle. Define the uncertainties Ax, Ap 
(with respect to y/) by equations (6.2.13).* Ax + \/ (Ax)^, Ap = +^/(Ap)^; 
indeed (Ax)^, (Ap)^ > 0. Then Ax, Ap satisfy the inequality 

(Ax)(Ap) > (6.2.19) 

We shall see that the continuity of \j/' in Theorems 6.1, 6.2 need not be assumed. 
In fact see (6.2.38) for a more general version of Theorem 6.2. Also see (6.2.33). 

The Heisenberg uncertainty principle (6.2.19) is, by far, one of the most funda- 
mental principles in all of quantum theory, especially as this theory is founded on 
laws of measurement. In some texts the uncertainty principle is stated (more or 
less precisely) as follows: Two observables 0\, O 2 can be simultaneously (accu- 
rately) measured only if their commutator [0\, O 2 ] vanishes; i.e., only if 0\ and 
O 2 commute. If [0\, O 2 ] 0, then these measurements can be made only with an 

uncertainty (AOi )(A02), the magnitude of which is dependent on the commutator 
[0\,02]. For 0\ = Mx and O 2 = f ^ as above, for example, -^[0\,02] = f U 
as we have seen in Section 1.3 of Chapter 1. Thus in (6.2.19) we indeed see the 
dependence of (Ax)(Ap) on the value of the commutator [M^, f and that in 
particular since [M^, f ^ 0 the position and momentum of a particle, at the 
quantum microscopic level, cannot be measured simultaneously with complete 
accuracy — a statement which is in stark contrast to classical mechanics situations 
where several dynamic quantities can be simultaneously measured with full accu- 
racy. 

Among the many philosophical implications of quantum mechanics one of the 
most striking is the nullification of classical determinism, wherein a system’s evo- 
lution is determined by initial data and acting forces. Newton’s deterministic laws 
predict future events from past knowledge. Given the positions and momenta of a 
system of mass particles at some initial time t = to and the forces of interaction 
of these particles, for example, the state of the system at any future time t can 
be determined exactly. Quantum mechanics, on the other hand, points to indeter- 
minacy as a feature of our universe. There are measurements of physical systems 
which are not determined by prior states of the system. In a scattering experiment, 
for example, it is not possible to know precisely where a particle will land after it 
meets the scattering object. We noted earlier that one cannot determine in advance 
at what point an electron passing through a small slit will strike a screen beyond 
that slit, although one can determine an interval (-xi,xi) on the screen where 
the probability of the electron striking therein is maximal; recall the discussion 
in Section 1.5 of Chapter 1. The uncertainty principle highlights the importance 
of M. Bom’s basic postulate, which introduces indeterminacy into quantum me- 
chanics at the outset. 

It is common to employ the Fourier transform in the formulation of Heisen- 
berg’s inequality, and thus of his uncertainty principle. We indicate how this is 
done, assuming some of the basic elements of Fourier analysis stated here with- 
out proof. In fact we give a second proof of the basic inequality (6.1.1). 
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There are various definitions (normalizations) of the Fourier transform, each 
chosen for a particular convenience. For y/ G L^(E, dx) we will set 

\j/(x) dt. (6.2.20) 

m 

With this normalization y/ is recovered from its Fourier transform y/ via the inver- 
sion formula 

Wix) = f \ dt (6.2.21) 

for almost all x e R, assuming that y/ e L^. Equation (6.2.21) holds at a point x 
of continuity of y/ (for yr € L^). A key result is PlanchereVs formula. 

Theorem 6.3. Suppose y/ e L^(R, dx) fi L^(R, dx). Then yr e L^(R, dx) and 




\yf(x)\^ dx = In ly/ix)]^ dx 2;r||i/4||2. 

. M Jr 



The Riemann-Lebesgue Lemma states that y/ is di continuous function on R 
such that 



lim y/(x) = 0. (6.2.22) 

x-^±oo 

Theorem 6.4. 

(i) Let y/ e L^ L^(R, dx) such that xy/ € L^. Then y/ is differentiable on R 

and y/ \x) = i^(x). 

(ii) Let y/ e L^ be differentiable on R such that y/' G L^. Then limx^±cx) wM = 

0, andy/\x) = -ixy/(x). 

(iii) For y/ G a,b eM. define /,^(x) = e^^^y/(x + b) on R. Then G L^ and 

fy^(x) = e~^^^^'^^^yr{x + a) [by the translation invariance of dx]. 

Assume now that i// : R ^ C is differentiable on R such that y/, y/', xy/ G L^. 
By Holder’s inequality y/'{xy/) G We integrate by parts: 

y/'(x)xy/(x) dx = lim y/'(x)xy/(x) dx 

Jr 

= j™. { -a “ I [W(x)(xv'{x) + y/{x))] dx | . 

(6.2.23) 



Again by Holder’s inequality y/(xy/) G L^ ; also [y/(xy7)]' = y/'(xy7) + y/{xy7') + 
y/y/ G L^ (since y/(xy/') = (xy/)y/'). Therefore, by (ii) of Theorem 6.4, 
lim^^ioo y/(x)xy/(x) = 0; hence by (6.2.23) 

y/'{x)xy/{x) dx = - y/(x)xy/'(x) dx - \y/{x)^ dx. (6.2.24) 

Jr Jr • r 
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That is. 



\y/(x)f dx = -2 Re ij/'(x)x\i/(x) dx 



which with the Cauchy-Schwarz inequality gives 



\y/(x)\^ dx\ =4(RQ{ii/',x\j/)f < 4\{y/' , xxi/)\^ 



< 4|k'|l2ll^V^ll2 = 4 dx ■ xV(^)P dx 



(6.2.26) 

def 

By considering /^(x) = e^^^y/(x + b) for a, b eRwt have already noted that the 
inequality (6.2.26) implies more generally that 



\y/(x)f dx\ <4 {x - br\xi/(x)f dx\ 



[\y/'{x)\^ - 2iay/'(x)\i/(x) + a^\\i/{x)\^] dx, (6.221) 



which is Heisenberg’s inequality (6.1.1) (without a continuity assumption on y/'). 
Here again the second integral on the right-hand side of (6.2.27) is 



i/;(x)p dx > 0. 



(6.2.28) 



Assume in addition that y/, y/' e U : y/, y/' e UnL^. Then /^(x) = e^^^y/'(x-\- 
b)+y/{x+b)e'“^ia => e L'nL^. By Plancherel’s theorem, Theorem 6.3, 

fL (j/ eL^ and 



ll/;iP = 2;r||/;||^, ||v)||2 = 2 ;t|H|2. 



(6.2.29) 



By (ii), (iii) of Theorem 6.4 /^(x) = —ixf^(x) = —ixe + a) 



WfwWl = + a)\^ dx= (X - a)^\y/(x)\^ dx, (6.2.30) 



so that 



[\y/'(x)\^ - 2iay/'(x)y/(x) + a^\y/(x)\^] dx 



(6.2.31) 



(x - a)^\y/(x)f dx, 
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which allows us to express the Heisenberg inequality (6.1.1) as 



y \y/{x)\ c/xj ^~y (x - b) \\j/(x)\ ^/xjy (x - a) \\i/(x)\ c/xj, 

(6.2.32) 

or more symmetrically as 



\y/(x)\^ dx \xi/{x)\^ dx\ 



<4 (x — by\\j/(x)f dx (x - ay'\y/(x)\^ dx\ (6.2.33) 

Jr J LJr J 

by (6.2.29). The validity of (6.2.33) has been established for : M ^ C dif- 
ferentiable which satisfies the conditions e L^{R,dx) n L^(R,dx), xy/ e 
L^(E, dx), though it could be established for more general y/. 

We consider also the dilation of the Fourier transform 

mip) (6.2.34) 

V2HJr 

fory/ e L^, p eRAfy/ e n L^, then by Theorem 6.3, y^e and 









(6.2.35) 



— \w{p)? dp = WwWl- 

. IB 



Similarly 



ip - a)V*(p)P ^P= ^ ^(f )| ^ 

= :^ [ ipfi- af\¥(p)\^ ip~l) \V(P)\^ dp. 

2^ Jr 2;rJiiV hJ 



(6.2.36) 



By equations (6.2.35), (6.2.36) and the inequality (6.2.33) 



= 4 

Jib 



4 (x-b) \y/{x)\ dx\ {p - ar\y/n(p)r dp 



(x - by\y/(x)f dx 






\¥(P)\ dp\ 



> Wi^)? dx — lv>(p)P dp 

. TO . TO 



^ \w{x)\ dx\ \\ \y/n(p)\ dpi , 



(6231) 
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or 

[l^jx - bf\yr{x)\'^ dx] - a)^|i^(j?)P dp] ^ ^ 

[JrIv^WP^^] [Jr Iv^(/>)P 4»] ~ 4’ 

which is a slightly more general version of the Heisenberg uncertainty principle. 
Equation (6.2.38) is established for i// G L^(E, dx)nL^{R, dx) differentiable such 
that xj/' £ L^(E, dx) fi L^(R, dx), xy/ £ L^(M, dx). 



6.3 Example 

Let y/n be the normalized wave function of Chapter 4 for the harmonic oscillator 
of frequency v: 



¥n(x) = Cn exp 
for Cn = (-!)" 




(6.3.1) 



« = 0, 1, 2, 3, . . . ; see definition (4.8.5) there. By (4.8.7), y/n{-x) = (-l)^y/n(x) 
=» \y/n(-x)\^ = \y/n(x)\^ ==> 



X = 



x\y/n{x)\ dx = 0 
as the integrand is an odd function. To compute p, consider 
y/'^(x) = Cn exp^-^x^^y/b - bxcxp(^-^^^ 

= Cn exp^-^x^^ - Hn+l + 2bx Hn{^bx^ - bx Hn 



(6.3.2) 




(by Corollary 4.2) 



= c„exp^-^ [-Vfc i/„+i(Vfex) + bx 



c»Vfe 

e«+i 



Wn+i(x) + bxy/„{x). 



By wave function orthogonality (4.8.6) and equation (6.3.3) 
p = - hi [ y/’„{x)y/„{x) dx 

Jk 

= —hi bxy/„{x)}ff„{x) dx 

m 

= —hibx = 0 



(6.3.3) 



(6.3.4) 



by (6.3.2). 
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Then by definition (6.2.15) 



(Ap)‘ 



= [ \(t±. 

Jr [\i dx 

= -»'[ w: 

m 



P ¥n 



(x) y/n(x) dx 



(6.3.5) 



{x)\f/n{x) dx. 



But y/n is a solution of Schrodinger ’s equation 

d^w 2m o o 

—r + -(E- 27r^v^mx^)y/ = 0 (6.3.6) 

dx^ ¥ 

for E = En = 2n{n + \)yh, as we know. Thus, since y/n is normalized, one has by 
Theorem 4.11 



i^py 



r 

= 2m {En - 27r^y^mx^)y/n{x)y/n(x) dx 

Jr 



= 2mEn - 



2 2 
^ ¥n 

Jr 



(x) dx 



(6.3.7) 






= 2mEn - 






= 2mEn - mEn 



= mEn, 



Again since x = 0 



(Ax)^ [ (x - x)^|vr„(x)p dx 

Jr 

En 



.i 



X^Wn(x) dx = 

R 



as in (6.3.7). We can conclude that 

VK 



(Ax)(Ap) = . " yImEn = ^ = ^(2n+ 1), 

27 tv\lm Inv I 



(6.3.8) 

(6.3.9) 

(6.3.10) 



which verifies Heisenberg’s uncertainty principle, (Ax)(Ap) > for every quan- 
tum state y/n of the harmonic oscillator. Moreover we see that equality (Ax)(Ap) = 
I is attained <=> « = 0; i.e., only for the ground state y/o(x) = ( ^ exp(-|x^) 
is equality attained. 

While we are at it, we can also compute the mean value V of the potential 
energy V for any quantum state y/n. According to definition (6.2.3) 

V = \ V {x)\y/n{x)\^ dx (6.3.11) 

Jr 

= 27u^v^m [ x^y/^(x) dx = (6.3.12) 

Jr 2 

by (6.3.9). That is, the mean value of the potential energy of a state y/n is half the 
quantum energy of that state. 
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Group Representations 
and Selection Rules 



7.1 Group Representations 

One of the main elforts of Chapter 5 was to determine explicitly the normal- 
ized wave functions y/nim of a hydrogenic atom; see equation (5.2.12). Here n £ 
{1,2,3,...},/ G (0,1,2,...,«- l},m £ {-/,-/ + 1,..., -1,0, 1,2,...,/} are 
the total, azimuthal, and magnetic quantum numbers, respectively. The triples 
(n,/,m) = (5,2, -2), (5,4, -2) define, for example, two quantum states with 
the same energy (by (5.1.29)) of the hydrogenic atom. One could ask the 
following question: Is it possible for the atom to proceed directly from the state 
(5,2, -2) to the state (5,4, -2): is the transition (5,2, -2) ^ (5,4, -2) always 
physically possible? The answer is no, as we shall see in this chapter. Moreover 
the answer comes by way of some beautiful applications of group theory. This 
example, of whether a certain transition is allowable or forbidden, illustrates one 
of many selection rules in quantum mechanics. The answer no just asserted turns 
out to depend on whether or not a certain integral (depending on the quantum 
numbers (5, 2, -2), (5, 4, -2)) vanishes. 

In this chapter therefore we shall consider very broadly, and abstractly, a se- 
lection rule to be some rule or means by which one can ascertain in advance that 
certain integrals vanish. From this viewpoint the familiar statement 

•a 

f{x) dx = Ofor / continuous on [-a, a] with /(-x) = -f(x) (7.1.1) 

j-a 

would be an example of a selection rule. Our goal is to formulate, purely in 
terms of group theory, a general abstract selection rule — which indeed will have 
concrete applications. This we do in the main theorem. Theorem 7.2. Also see 




172 



7. Group Representations and Selection Rules 



in fact Theorem 7.6. In Section 7.7 we specialize our discussion and consider, 
more concretely, spectroscopic selection rules. To prepare for Theorem 7.2 we re- 
view some elementary definitions and facts about group representation theory — 
material available in many standard texts, collected here for the reader’s conve- 
nience. 

Let G denote a fixed abstract group. A representation of (7 on a finite-dimen- 
sional vector space V over some field F is a group homomorphism n \ G 
G1(F) from G to the group of non-singular linear operators G1(F) on V. Thus 
7 t{x\X2) = ^(xi) o n{x 2 ) for xi, X 2 G G. In practice G will be a topological group 
(even a Lie group) and V will be defined over F = M or C, in which case n will 
be required to be continuous: if K* denotes the dual space of V then for each pair 
(v, /) e F X F* the map : G F given by 

^v,/(x) = /(;r(x)v) for X e G (7.1.2) 

is continuous. The degree or dimension of n is the dimension of F over F. Some 
examples of continuous representations are as follows. 

1. Take G = R, F = R^ and define ;r : R ^ Gl(R^) by 7 t{x){a, h) = {a -\- 
bx,b) for X G G, (a,b) G R^. Clearly ;r(x) : R^ ^ R^ is a non-singular 
linear map such that ;r(xi + X2) = ^(xi) o ;r(x2) for xi,X2 G R. If / G 
(R^)*, then for e\ = (1,0), 62 = (0, 1), (ai,bi) G R^ one has f((ai,bi)) = 
^ 1 /(^ 1 ) + b\f(e 2 ). Therefore n;{a,b)j : R R in equation (7.1.2) is given 

dcf 

hy jT(a,b)jM = f(a + bx,b) = (a H-6x)/(ci) + Z?/(c 2 ), which is continuous 
in X. Thus ;r is a continuous 2-dimensional representation of R on R^. 



2. Take G = RflnZ (= the circle group 5^) and F = C; Z = the ring 
of integers. For each fixed n £ Z define k = : RjlnZ G1(C) 

by ;r^"^(x)^ = e^^^z for ^ G C, x = the class x -h InZ of x G R. The 
non-singular linear operator ;r^"^(x) : C C depends only on the class 
X and not the representative x of the class, since = 1 V m G Z. For 
(z, /) G C X C* the map : RjlnZ C in equation (7.1.2) is given 

by ;r^”j(x) f{e^^^z) = which is continuous, since the compo- 

sition R RjlnZ C (x is continuous. Each n e Z 

therefore gives rise to a continuous 1 -dimensional representation of 
G = RjlnZ on C. With respect to the standard inner product ( , ) on C 

(i.e., (zi,Z 2 > = ZiZi) each;r^"^ is unitary: (;r^"^(x)zi, ;7r^”^(x)z2> = {z\,Z 2 ) 
for X G G, ^ 1 , Z 2 ^ C. It is not hard to show that the family }„ez gives 
all continuous, 1 -dimensional, unitary representations of G, up to unitary 
equivalence. Namely if ;r : G G1(F) is any continuous, 1 -dimensional 
unitary representation of G, then for some n £Z there is a unitary transfor- 
mation G : F ^ C such that the diagram 
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V 



7C{X) 

> 



V 



u 



u 



c > c 



commutes V x G G. 

3. Let G = 0(n) be the topological group of real n x n orthogonal matrices: 
AA^ = 1 for 1 = the « X rt identity matrix, where A^ is the transpose of 
A e G. For V = R” or C" let ;r : G G1(F) be the canonical map given 
by .;r(A)(xi, . . . ,x„) = (xi, . . . ,x„)A^ for x = (xi, . . . ,x„) € V. Then ti: 
is a representation of G on V. Let c, = (0, . . . , 1, 0, . . . , 0) € V where 1 
occurs in the iih slot. Similar to example 1, one has /(x) = 

for any / G F* so that for v = (vi, . . . , v„) G F the map : 0(n) ^ R or 
C in equation (7.1.2) is given by 7tyj{A) = where x = vAK 

That is, X, = VjOij for A = [aij] ?Tv,/(A) = Yi,j=i Vjfiedaij 

is a continuous function of A => ;r is a continuous ^-dimensional rep- 
resentation. In particular if SO(«) is the subgroup of 0(n) of matrices A 
with det A = 1 (SO{n) is called the special orthogonal group), then by re- 
striction we obtain the continuous representation n : SO(n) G1(F) of 
SO(n) on F. 

4. Let 7T be the continuous representation of G = 0(3) on given in exam- 
ple 3. Given any function ^ : R^ ^ R or C define A • ^ : R^ R or C for 
A G G by 

(A • 4>){x, y, z) ‘*= ^(x-(A”*)(x, y, z)) ‘*= 4>{{x, y, z)A), (7.1.3) 

since A”^ = A^ In particular for the projections p\,p 2 ,p?, : R^ ^ R 
onto the first, second, third coordinates respectively, A = [ajj] => 
(x, y, z)A = (aux + a 2 \y + a^iz, aux + a 22 y + ^^32^, cix^x + a 2 ^y + 033 ^) 
=» (A-pi)(x,y,z) = anx + a2\y + a2,\z = anP\(x,y,z)Aa2\P2{x,y,z) + 
a?>\P?>{x,y,z) =» 



A • Pi = fliipi -h a2\P2 + fl3iP3- (7.1.4) 

Similarly, 

A'P 2 = auPi + a 22 P 2 + CI 32 P 3 and (7.1.5) 

A- P3 = a\ 3 pi + a23Pi + CI 33 P 3 . (7.1.6) 

Let 0(3) • pj, {p\,P 2 ,P 3 ) denote the complex subspaces of C(R^,C) (the 
space of C- valued continuous functions on R^) spanned by the vectors 
{A-py|A G 0(3)}, {pi,P 2 ,P 3 } respectively; 1 < J < 3. By equations (7.1.4), 
(7.1.5), (7.1.6) we have 



0(3) ’Pj C <Pi,P2,P3)* 



(7.1.7) 
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Consider 



0 


-1 


0" 


. def 

,A 2 = 


0 


0 


-l' 


i def 

.^3 = 


1 


0 


o' 


1 


0 


0 


0 


1 


0 


0 


0 


-1 


0 


0 


1 




1 


0 


0 




0 


1 


0 



(7.1.8) 

Note in fact that Ai,A 2 , A 3 € SO(3). By (7.1.4), Ai -p\ = p 2 ,A 2 -p\ = Pi, 
l-Pi=Pi ==^ Pi,P 2,P3 e SO(3) -pi ^ (pi,P 2,P3> C SO(3) -pi 
(Pi,P 2 ,P 3 > = SO(3) • Pi = 0(3) • Pi by (7.1.7). Similarly ■ pi = P 3 , 
Ai -P 2 = -Pi, 1 -P 2 =P 2 ==4> P1,P2,P3 € S0(3) -P 2 =^> (Pl,P2,P3) = 
S0(3) • p 2 = 0(3) • p 2 , and A 3 ■ p^ = -p 2 , A 2 ■ ps = -pi,l • P 2 = P 3 => 
PUP 1 ,P 3 € S0(3) • P 3 (P 1 ,P 2 ,P 3 > = S0(3) • P 3 = 0(3) • P 3 . That is, 
{Pi,Pi,P3) = S0(3)-pi = S0(3)-P2 = S0(3)-P3 = 0(3)-pi = 0(3)-p2 = 
0(3) -p 3 , with dimc(pi,P 2 ,P 3 ) = 3 as pi,p 2 ,P 3 are C-linearly independent. 
For A e 0(3) and ^ C we have defined A • ^ : E^ C by (7.1.3). 

If <^ € (pi,P 2 ,P 3 >, then A ■ (^ e (pi,P 2 ,P 3 > by (7.1.4), (7.1.5), (7.1.6). 
Thus we have a linear operator ;r^^^(A) : (pi,P 2 ,P 3 ) (Pi>P 2 >P 3 ) given 

by it^^\A)^ = A- 4>. By (7.1.4), (7.1.5), (7.1.6) the matrix of n^^\A) 
relative to the ordered basis {pi, P 2 , P 3 } is A. Hence ;r*^^(A) is non-singular. 
Indeed the map : 0(3) Gl((pi,p 2 ,P 3 )) is a representation of 0(3) 

on (pi , p 2 , P 3 ). We note however that the two representations n, of 0(3) 

are equivalent: 3 a linear isomorphism T of (p 1 .p 2 .p 3 ) onto such that 
the diagram 



^( 3 )(^) 

(P1.P2.P3) >■ (P1.P2.P3) 

^ 

n^A) 

commutes V A G G = 0(3); compare the similar situation in example 2. 
In particular k is also continuous (although continuity is easy to check 
directly). T is given by Tpj = ej where e\ = (1,0,0), €2 = (0,1,0), 

63 = (0. 0, 1). Indeed for 1 < ) < 3, Tz^^\A)pj = TA -pj = T ayp, 
(by (7.1.4), (7.1.5), (7.1.6)) aijei = (aiy, 02 ;, asy). On the other 

hand n{A)Tpj n{A)ej ejA^ = {a\j, a2j, ay), as desired. 



5. Let P\,P 2 ,P 3 : R be the projections of example 4; each pj G C°°(R^). 

Given an integer / > 0 let I 30 space of polynomial functions p in 
x,y,z that are homogeneous of degree 1: p = YjiCiPi where cj G C, 

I = (fl, b, c) for integers a, b,c > 0, with pi p\p\p\ and where, by the 
homogeneity condition p{tx, ty, tz) = t^p{x, y, z) for t G R, one has ^2 + ^ + 
c = / — ^by Euler’s theorem for example which says that = 

Ip. Conversely, if a + b + c = I for each /, then p is clearly homogeneous 
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of degree /. Let 

H^‘^ = {peP^‘^\W^p = 0] (7.1.9) 

where is the Laplacian ^ ^ ^ on E^. By Theorem 4.13 in 

Appendix 4 A the spherical harmonics are elements of for 

example. More generally if = TIj=\ ^ Laplacian on R", then 

for / : R" R or C a function with continuous first and second partial 
derivatives one has (essentially by the chain rule) 

V'(A-/) = A-VV (7.1.10) 

Hpf 

for A e 0{n) where (A • f)(x) = f{xA) for x G R" as in (7.1.3). A proof 
of (7.1.10) is presented in Appendix 7C. In particular if / is harmonic (i.e., 
= 0). then so is A • /. Also by equations (7.1.4), (7.1.5), (7.1.6) 
in example 4 one has A • p G for p e P^^\A G 0(3). Namely for 
A = [dij] 

A-p=^c/A-p/, (7.1.11) 

I I 

where for I = (a, b, c) with a + b + c = I, 

A-pi = {A’ pi)^(A • P2)\A • p^y 

= (aiipi + a2iP2 + ci2ip2,y • (ai2Pi + a22Pi + ^ 32 ^ 3 )* (7.1.12) 

• («13Pl + (^23P2 + ^^33P3)^- 

By the binomial theorem 

(anPiAa2iP2 + d3iP3y = f ^ f ^^i\^2T^3VP\P2~'pT'^ • (7.1.13) 

0<r<fl 

0<s<r 

Similarly (a^Pi + aiiPi + a32/>3)*, {anP\ + aiiPi + asspsY are linear com- 
binations of terms p^p'^" p'’i' , p'lpl~'’p‘f'', respectively, where 
and where the coefficients are polynomial functions of the 0 , 7 . Since 

P\P 2 P 3 P\P 2 P 3 P 1 P 2 P 3 

s+u+w r-s+t-u+v-w a-r+b-t+c-v _ jyU) 

= Pi P2 P3 ^ ^ 

(i.e., s-[-u+w+r-s+t-u+v-w+a-r+b-t+c-v = a+b+c = /) we see that 
by (7.1.12), (7.1.13) A-p £ P^^^ (which follows by simpler arguments), but 
we also see that if cu G then the function A co{A • p) is continuous 
on 0(3), and is even of course, and is even polynomial in the entries 
dij of A. That is, the equation 



k^^\A)p^= a -p, (A,p) G 0(3) X P^^^ 



(7.1.14) 
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defines a continuous representation of 0(3) on which by (7.1.10) 
has as an invariant subspace: C V A g 0(3). We can 

therefore regard also as a continuous representation of 0(3) on 
and by restriction to S0(3), is a continuous representation of S0(3) on 

We state without proof the following standard facts: dim P^^^ = (/ + 2)(/ + 
l)/2 and dim = 2/ + 1; in fact is a C basis of The 

representation of S0(3) on is irreducible: if V C is any 
subspace such that c V \/ A € S0(3) then V = either {0} or 

In particular is an irreducible representation of 0(3) on If 
K is any continuous, irreducible representation of S0(3) on some complex 
vector space V then n is equivalent to for some integer / > 0: As in 
examples 2 and 4 the diagram 

HH) }J{1) 

V ^ V 

^{A) 

is commutative for some suitable linear isomorphism T. We remark that the 
representation n of SO(3) on C^, in example 3, is known to be irreducible. 
Since dim tt = 3 = dim = 21 + 1 =» / = 1 we must therefore have 
that TT is equivalent to ! 

6. By Corollary 5.1 the wave functions {y/nim}^m=-i are linearly independent 
over C. The C-span S„i in L^(R^,dxdydz) of the {y/nim}^m=-i therefore 
a (21 + l)-dimensional complex vector space, which we claim moreover is 

0(3)-invariant. For A e 0(3), (x, y, z) € |(x, y, z)|^ + y'^ + z^ = 

|(x,y, z)Ap, so that by (5.2.15) and (7.1.3) 

dcf 

(A ■ >J^nlm)(x,y,Z) = Wnlmiix,y,Z)A) 

= cimC~^J=exp(^~\(x,y,z)\^ ( 7 . 1 . 15 ) 

■L^^t'\a„\{x,y,z)\)Y,y(x,y,z)A). 

By example 5 we can write 

y”((x, y, Z)A) = (jt^'\A)Y,'”)(x, y, z) 

^ / ( 7 . 1 . 16 ) 

= 2 c;„(A) Y/(x,y,z) 

J=-l 

for suitable Cjm(A) e C. Then 

i /+i 

(A-y/„i„)(x,y,z) = y cj„{A)c,„c^,^^ — exp (~\{x,y, z)\] 

V 2;r cij ^ ^ ^ 
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■ Lflt^\an\{x,y, ?:)|) T/(x, y, z) 




^ Cjm(A) 

= Ci„ 2_, Wnij(x,y,z) 

J=-l ‘"j 


(7.1.17) 


— Qm Wnlj ^ 

J=-l 


(7.1.18) 



as claimed. The equation 

n^"‘\A)f A- f, A € 0(3), / e S„,, (7.1.19) 

therefore defines a representation of 0(3) on S„i. If is the 

basis of dual to then the Cjm(A) in (7.1.16) are given by 

Cjm(A) = co^l\7r^^\A)Yl^) and are therefore continuous functions of A, 
since the representation is continuous. By (7.1.18) it follows that V m, 
and T £ Sfii* the function 



A ^ 



Ti7t^"’‘\A)ll/„iJ = C,„ ^ 
J=-l 



CjmiA) 

CIJ 



<Wnlj) 



(7.1.20) 



is continuous on 0(3). Hence is a continuous representation. 

7. Define k \ G G1(F) by 7t{x)y = v for x G G, v e F. ;r is called 
the trivial representation. Of some interest is the trivial one-dimensional 
representation n \ G ^ G1(C). 

The character of a finite-dimensional group representation jt : G ^ G1(F) 

def 

is the function : G F = the base field of V given by XniE,) = 
trace ;r(g) for g G G. If G is a compact topological group it is known 
that two finite-dimensional, continuous representations of G with the same 
character are equivalent; clearly, equivalent representations have the same 
character. For example, by (7.1.18) 

/ 

;?<"’'>( A) = 2^ 7t^"’‘\A)j^ y/nij, where 

j=-‘ (7.1.21) 

ClJ 

That is, for every A e 0(3) 



X^m(A)= ^ ^ Cjj(A) 



J=-i 



j=-i 



= trace;r^^\A) X7t^‘^(A) 
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(by (7.1.16)) which shows that the representations and are equiv- 
alent. In particular, is irreducible. In fact, by example 5, the restriction 
of to SO(3) defines an irreducible representation of SO(3). 

Note, for example, that the representation ;r : R Gl(R^) in example 1 

def 

is not irreducible. Namely for x,a e R, jr{x)(a,0) = (^2,0). That is, 
R {(a,0)\a € R} c R^ is a subspace which is ;r(x)-invariant V x G R, 
and which is neither {0} nor R^. 

It is clear that we could also define a finite-dimensional representation of the 
group G as a homomorphism tt : G Gl(n, F) = the group of non-singular 
n X n matrices over the field F: ;r(xiX 2 ) = ;r(xi);r(x 2 ) for xi,X 2 e G. If G has 
a topology and F = R or C (or if F is a topological field), then continuity of tt 
could be taken to mean that for 1 < ij < n the functions g ^{g)ij : G ^ F 
are continuous where n{g) = [^(^)/;] • ^ is the degree or dimension of 7t. In 
example 1, for example, we obtain the “matrix version” of the representation k : 
R ^ Gl(R^) via the homomorphism ;r : R ^ G1(2,R) given by 7t{x) = [o i ]» 
X G R, which is simply the matrix of ;r(x) G Gl(R^) relative to the standard 
ordered basis {(1,0), (0, 1)} ofR^. 

We will also have occasions to consider infinite-dimensional representations, 
say, for example, G has a locally compact Hausdorff topology and tt : G 
U(H) is a homomorphism to the group of unitary operators U(H) on a Hilbert 
space H. Since every continuous (i.e., bounded) linear functional f : H C 
on H has the form / (a) = (a, /?) for some unique p e H, { , ) being the inner 
product structure on H, we can (with definition (7.1.2) in mind) take continuity 
of 7T in this case to mean that the functions x ^ (/r(x)v, P) on G are continuous 
^ v,P e H. Here irreducibility of n would mean that if Hi C H is any closed 
subspace such that ;r(x)Hi c Hi V x G G, then H\ = either {0} or H. n is called 
a unitary representation since each operator ;r(x) : H H, x G G, is unitary: 
(;r(x)vi,.7r(x)v2) = (vi,V 2 > for vi,V 2 € H. 

7.2 Contragredient and Tensor Product 
Representations 

Given representations n,n\,n 2 of a group G we can form other representations 
;ti © ;t 2 , ^1 ^712,71*, etc. of G. We review these constructions as they are needed 
for later applications. Assume G is a topological group and that we are dealing 
with finite-dimensional, continuous representations on complex vector spaces, for 
specificity. If ttj \ G ^ G1(V}), j = 1, 2, are two such representations we can form 
the external direct sum n\ © 7i2 which is a representation of G on Fi x F 2 : 

def 

e;T2)(x)(vi,V2) = (;ri(x)vi,;r2(x)v2) ( 7 . 2 . 1 ) 

for X e G, (vi,V 2 ) € Fi X F 2 . Let h ; Fi ^ Fi x F 2 , h : F 2 ^ F x F 2 be 
the linear injections vj (vi,0), V 2 (0, V 2 ). If F e (Fi x F 2 )*, then F o ij e 

V* ^ X ^ F o ij{7ij(x)vj) : G C is continuous, since nj is continuous 
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X ^ e ;t2)(x)(vi,V2)) = {F o /i)(;ti(x)vi) + (F o i2){n2{x)v2) is 

continuous. That is, k\ 0 ;t2 is a continuous representation. 

Given the representation n \ G ^ G 1 (F) there is a naturally associated repre- 
sentation ;r* : G ^ G 1 (F*) of G on F* defined by 

(;r*(x)/)(v) = /(;r(x-‘)v) ior (y, f) eV xV* ,x e G. (7.2.2) 

Thus n*{x) is the transpose of ;r(x“'). tc* is called the representation dual or 
contragredient to tu. To see that ;r* is continuous we use the duality (F*)* F; 
i.e., given ij/ e (F*)* there is a unique v G F such that y/if) = f{v)^ f £ V*. 
Then the function : G -> C in (7.1.2) is given by x ^ i//(;r*(x)/) = 

(;r*(x)/)(v) /(;r(x”^)v) which is continuous since tu is continuous and the 

map X ^ x“^ of G is continuous, G being a topological group. 

Next we construct the tensor product 7C\ 0 ki of n\ and ki. Let (T, h) be the 
tensor product over C of V\ with F2. Here we take Fi, F2 to be arbitrary (even 
infinite-dimensional) vector spaces over C. That is, T is a C- vector space and 
Z? : Fi X F2 ^ T is a bilinear map, where the pair (T, h) satisfies the following 
universal condition: Given any bilinear map b\ : Fi x F2 IF = any C- vector 
space, 3 a unique linear map b\ :T ^ W such that the diagram 




Fi X F 2 



(7.2.3) 



is commutative. (T, b) exists and is unique up to a natural equivalence. One writes 
T = Fi 0c F2 = Fi 0 F2, b(v\,V2) = vi 0 c V2 = vi 0 V2, for (vi, V2) G Fi x F2, as 
usual. If F3, V4. are also C-vector spaces and T : Fi ^ F3, : F2 F^ are linear 

def 

maps, then the map bj,s : Fi x F2 ^ F3 0 F4 given by bj^sG^i , V2) = Tvi 0 Sv2 is 
bilinear, since b is bilinear. Hence 3 a unique linear map br,s : Fi 0 F3 0 F^ 
such that 

bT,s(^\ ^ ^2) = (7.2.4) 

def ' — ' — ' 

for (vi, V2) G Fi X F2 by (7.2.3). Define T <S> S = br,s- Thus T 0 5 : Fi 0 F2 ^ 
F3 0 P4 is the unique C-linear map such that (T 0 5 )(vi 0 V2) = Tvi 0 S\2 for 
(vi,V2)gFi xF2,by (7.2.4). 

Now for the finite-dimensional representations ttj : G Gl(Fy), 7 = 1,2, for 
each X G G, ;ti(x) : Fi ^ Fi, 7T2(x) : V2 V2 are linear maps; in fact they 
are automorphisms. By the preceding discussion, there is a unique linear map 
7ii{x) 0 7T2{x) : Fi 0 F2 ^ Fi 0 F2 such that 



(;ti(x) 0 ;t2(x))(vi 0 V 2 ) = ^i(x)vi 0 7U2(x)v2 (7.2.5) 



for (vi,V 2 ) G Fi X F 2 . We note that ;ti(x) 0 jt 2 (x) is surjective [and thus is au- 
tomatically one-to-one by dimension]. Namely any r G Fi 0 F 2 is a finite sum of 
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terms V{ 0 vi with (vi,V 2 ) e V\ x Vi. Since 7i2(x) are surjective we can 
write = 7Tj(x)v'j for suitable v' € Vj. Then vi 0 V 2 = 7r\{x)v[ 0 ^ 2(^)^2 = 

(;ri(x) 0 ;r 2 (x))(v'^ 0 V 2 ), by (7.2.5), ==> ;ri(x) 0 ;r 2 (x) is indeed surjective. That 
is, we have a map ;ti 0 ;t 2 : G Gl(Fi 0 V 2 ) defined by 

{k\ 0 7t2){x) = K\{x) 0 K 2 {x) for X € G, (7.2.6) 



which indeed is a representation: 



{{7t\ 0;T2)(XiX2)](Vi 0 V 2) 
dcf 

= [;ti(xiX 2) ® /r2(xiX2)](vi ® V2) 
def 

= ;Ti(XiX2)Vi ® ;T2(xiX 2)V2 = X-i(Xi)x-l(X2)V] ® ]l2{Xi)][2{X2)y2 
def 

= (;Ti(Xi) ® ;T 2(Xi))[;ri(X2)Vi ® TC2(X2)V2] 

= [m\{Xx) ® ;T2(Xi)] O [x-i(X2) ® ^2{X2)](Vi ® V2) 
def 

= [(7Ti 0 ;T2 )(Xi)] o [{n\ 0 ^2)(^2 )](vi 0 V 2) 

[(^1 0 Jl2){X\X2)\t 

= [(/Ti 0 ^ 2 )(^l)] O 1(^1 0 ^ 2 )( 2 C 2 )]^ Vr G Fi 0 F 2 . 



Finally, we check that Jt\ 0 7t2 is continuous. Choose dual bases { } / , { eP } / of 
Vj,V*J = 1,2. For cu G (Fi 0 F 2 )*, t G Fi 0 F 2 we use again that t is a finite sum 
of terms vi 0 V 2 with (vi, V 2 ) eV\X V 2 . Thus to see that x co({7i\ 0 ;t 2 )(x )0 is 
continuous on G it suffices to see that x ^ cu((^i 0 ^ 2 )(^)(^i ^ ^ 2 )) is continuous 
on G. Write, for x G G, 



V2 = X = Z (7.2.8) 



Then 



® ;t 2 )(x)(vi ® V 2)) 

(o{jt\(x)v\ ® X- 2 (X)V 2 ) 

= (o ^*(^ 1 ) ® ^ ef\\2) Jt2ix)ep 

= X X ef%2ix)ef^) ® 4^’), 

U M 

(7.2.9) 



which is a continuous function of x as tui, 7^2 are continuous. 

Since we have certain notation in place and equation (7.2.9) we establish a 
simple linear algebraic formula for later use. Let e* : C 0 C C be the canonical 

def 

vector space isomorphism given by e{zi ^Z 2 ) = ZiZi^or zi,Z 2 ^ C. Then cOi j = 
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e o ief’* ® ef*) e (Fi ® Vj)* satisfies ® e'?) = e(5,> ® = 

5(i,y),(r,s)- That is, {coij} is the basis of {V\ ® V 2 )* dual to the basis (e-*^ ® 
of Vi ® Vj. By (7.2.9), for any x € G 

® 7t2){x){ei^^ ® e®)) 

= ^ Sn,r6isep^ (;ri(x)c2Vf’ (^2(^)e®) d(,j),(M) 

m,l,p,q 

(12 10) 

Z (l)% / X (l)x (2)% . ^ (2). ^ ^ ^ 

P,Q 

(1)*/ / X (l)x (2)*. . s (2). 

= e, (Jii(x)er ) 6j iJi 2 ix)es ), 

which is the formula we desire. 

7.3 Subrepresentations 

If JT : G G1(F) is a representation of G and Fi c F is a ;r-invariant subspace 
(i.e., ;r(x)Fi C Fi V x G G) then we can construct a representation ki : G 
Gl(Fi) of G on Fi by setting 



;ti(x) = ;r(x)\y^ (7.3.1) 

for X G G. 7t I is called a subrepresentation of ;r. In example 5, for example, the 
representation of 0(3) on is a subrepresentation of the representation 
of 0(3) on Note that tti is continuous, assuming that tc is continuous, 
since any /i G Fj* is the restriction to V\ of some / G F*. If F were an infinite- 
dimensional Hilbert space, for example, we would take V\ to be a closed sub- 
space and /i : Fi ^ C to be a hounded linear function. Then we could choose 
a bounded linear functional / on F such that f\v^ = /i by the Hahn-Banach 
theorem. 

More generally, a representation k\ \ G Gl(Fi ) is said to be contained in 

K : G ^ G1(F) if JT\ is equivalent to some subrepresentation of tu: For some 
;r-invariant subspace F^ c F the diagram 



Fi Fi 

4 h 



Vo 



^(x)\Vo 



Vo 



(7.3.2) 



is commutative, where T is a suitable linear isomorphism. Note that the trivial 
one-dimensional representation tti : G ^ G1(C) of example 7 is contained in 
7T <=> 3 a one-dimensional subspace F, c F such that 



Jt(x)Vo = Vo V Vo G Vo. 



(7.3.3) 




182 



7. Group Representations and Selection Rules 



By example 6, the representation of 0(3) on 5„/ is contained in the represen- 
tation of 0(3) on As we have seen k is irreducible if {0} and V are the 
only ;r-invariant subspaces of V. Here we assume V ^ {0}. Thus irreducibility 
of n means that k has no non-trivial subrepresentations. 

Suppose we have V = Fi © • • • 0 as a direct sum of m subspaces Vi with 
each Vi ;r-invariant. Then we have m subrepresentations /r, = 7r\y. of tt and we 
say that tt is the internal direct sum of the ;r, . Using the same notation for the 
external direct sum (see Section 7.2) we write ;r = ;ti © • • • © ;r^. By example 6 
of Section 7.1 for each integer / > 0 there is exactly one continuous, irreducible 
representation of SO(3) of dimension 2/ -I- 1, up to equivalence. A natural 
question arises: What is the structure of the tensor product 0 for two 
integers I 1 J 2 >0? The answer is given by 

Theorem 7.1 (The Clehsch-Gordan Formula). Up to equivalence 

0 © ... © (7.3.4) 

Formula (7.3.4) exhibits all the possible irreducible representations of SO(3) 
contained in 0 For example 0 © 

7.4 Group Actions 

G is again assumed to b^ a topological group. G is said to act on a topological 
space X on the left ify g e G there is a homeomorphism hg : X X of X 

dcf 

such that the following conditions hold. Set g • x = hg{x) for x e X , g e G. We 
require that {g\gi) -x = g\-{g2-x) [or (gigj) •x = g2-(gi-x) for a right aetion], 
1 . X = X for 1 the identity element of G, gi, g 2 e G, x 6 X. We also require that 
the map G x X X given by (g, x) -y hg(x) = g • x be continuous. Given a left 
action, note that the map 

(g,x) ^ g”' -X 

defines a right action. 

Some examples of left group actions are the following. 

dcf 

1. Let G act on A = G by left multiplication: hg(gi) = gg\ for g,gi e G. 
By definition of a topological group, the above map G x X X (i.e., 
(g,gi) hg(gi) = ggi) is continuous. 

2. More generally, for a closed subgroup H of G let X be the coset space 
G/H with the quotient topology. For g e G, define hg : X X by 
hg(giH) = (ggi)H for g,gi G G. 

3. Let n \ G G1(F) be a continuous representation of G on F where V 

is finite-dimensional over F = R or C. Let r be the weak topology on V 
generated by the / G F*. This is the same as the topology on F obtained 
by a metric on F induced by an isomorphism F for some choice 
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of basis of V. Let X = (F, r) and define hg : V V by hg(v) = 7r(g)v 
for g G G, V € V. Such actions, coming by way of a representation of 
G are called linear actions. From example 3 in Section 7.1 we obtain, for 
example, the linear action of 0{n) on W or on C” by setting 

g • X =’ hgix) = xg' 



for g e 0(«), X e R”, C" a row vector. 

4. Define hg : G G by hg(x) = gxg~^ for g, x € G. Thus hg is the inner 
automorphism of G induced by g, and one has indeed that (gig 2 ) • x = 
gi •(g 2 'X)forgi,g 2 ,xGG. 

7.5 An Abstract Selection Rule 

As indicated in introductory remarks to this chapter, our goal is to formulate in 
broad, abstract terms a “selection rule” — a group theoretic result by which we 
may determine that certain integrals vanish (of course without actually computing 
these integrals). Such a result seems to be in use in physics and chemistry implic- 
itly where often enough no clear general statement, or careful set of hypotheses, 
is set forth. Toward achieving our goal we consider the following context, and set 
of hypotheses. 

As before G is a fixed topological group which acts on a topological space X 
on the left. We assume now that G and X are locally compact (with Hausdorff 
topologies). Eventually we will take G to be compact. We also assume that on 
X there is some G-invariant measure dx: For CdX) = the space of continuous, 
compactly supported complex- valued functions on X 

\ {g'f)Mdx=\ f{x)dx, where (7.5.1) 

ix ix 

{g-nix) = f{g-'-x) (7.5.2) 

for (g, f)eGx CdX). By “measure” we mean that dx is a Radon measure; one 
could also take dx to be a regular Baire, or Borel measure. The G-invariance of 
dx in (7.5.1) means that the equation 



(ndg)f){x) = /(g-‘ • X) = (g ■ /)(x) (7.5.3) 

for (g,x,f) e G X X X L^{X,dx) defines a unitary representation tt = jul of 
G on the Hilbert space L^{X, dx) of square-integrable complex- valued functions 
(with respect to dx) on X : 



lk(g)/||^ = 



|(g . /)(x)p dx = 



|/(x)|2 dx = WfWl 



(7.5.4) 



Proposition 7.1. The representation ul ofG on L^(X, dx) is continuous. 
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First we prove a standard lemma. 

Lemma 7.1. Let f e Cc(X), go e G. Then for e > 03 a neighborhood Uq of go 
in G (i.e., Uo is an open set in G containing go) such that 

g€Uo^ l/(g"'-x)-/(g„-'-x)l <e VxGX. 

Proof As / has compact support let f{X - K) = 0 for K c X compact. Let 
(7 be a neighborhood of go in G such that its closure U is compact (since G is 
locally compact). Then UK c X is compact. Take x eU-K. As y ^ figo^-y) 
is continuous on X, 3 a neighborhood Ax of x in X such that y e Ax => 
l/(g;' • >>) - /(go”* • < f. given e > 0. (g,y) ^ /(g“' • y) is continuous 

on G X X SO that, similarly, 3 neighborhoods Ux, Vx of go, x in G, X such that 
(g,y) ^_l/(g-' • y) - /(g„-' • x)| < f. T^en (A, n 

is an open covering of G • X in X. The compactness of G • X guarantees the 
existence of some finite open subcovering: G • X c Axj n Vxj for suitable 
xi, X2 , . . . , € X. Define Uo = U f) (ri)=i which is open in G and which 
contains go. Fix g £ Uo and let x G X be arbitrary. Consider two cases: (i) 
x€ G-X, (ii)x^G-X. 

In case (i), x e Axj n for some xj. We have g £Uxj,x e Vxj \f (g~^ • 

:«)-/(go”'-^y)l < f.andx € Ax^. |/(g;*-x)-/(g;'-x^)| < f l/(g”‘- 

^)-/(gJ*-^)l < l/(g”*-^)-/(go‘-^y)l + l/(g;‘-:«;)-/(g;'-^)l < f + f = e. 

On the^ther hand in case (ii) we have • x ^ X; otherwise x 6 g • X c 
G • X c G • X is a contradiction. Similarly go 6 G => g“^ • x ^ X so that 
/(X - X) = 0 => 1/ (g“^ • x) - / (g“^ • x)| = 0 < c, which proves the lemma. 

To prove Proposition 7.1 we first show that g ^ (^(g)/i, /2) is continuous on 
G for /i G Cc(X), /2 G L2(X, dx) fixed: For go G G 

l(^(g)/l,/2)-Wg»)/l,/2)P 
= K^f(g)/l -^(go)/l,/2)P 
< Il/2ll2lk(g)/l -•^(go)/lll2 

(which is the Cauchy-Bunyakovskii-Schwarz inequality) (7.5.5) 
'=11/2112 [ |/l(g”* •^)-/l(go' -^)|^ dx 

JX 

= 11/2112 [_ -x)- fi(g;^ -x)f dx, 

JUK 

where /i (X - X) = 0 for X c X compact and where G = a neighborhood of 
go in G such that G is compact, as in the proof of Lemma 7.1, where we saw in 
case (ii) there that x ^ G • X => fi(g~^ • x), fi(g~^ • x) = 0 for g G G. Thus 
given c > 0 choose Uo containing go as in Lemma l.l,Uo C G. In particular 
yxeU-K,geUo 

[_ I ~ fdgo^ • ^) I ^ dx<€^ • measure of G • X, 

iuK 
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where the measure ofU-K<ooa.sU-Kis compact and dx is regular. Hence 
g ^ {^(g)fuf2) is continuous for /i G CdX), fi € L^(X,dx). Since Cc(X) is 
dense in L^(X, dx) (again for dx regular) we have continuity of g ^ i^ig) f\,h) 
for /i, /2 G L^{X, dx), as desired. □ 

7Tl is called the left-regular representation of G on L^{X, dx) since it coincides 
with the familiar left regular representation of G on L^(G,dg) in case X = G, 
with the left action being left multiplication, where dx = dg =3. left Haar measure 
on G. Since the Haar measure of any non-empty open subset of G is positive we 
impose similarly the condition that 



the dx-measure of any non-empty 

(7.5.6) 

open subset of X is positive. 

From the condition (7.5.6) it follows in particular that if C 2 {X) is the space of 
continuous, complex-valued, square-integrable functions on X, then the equation 

(/i,/ 2 >=[ fdx)j2(x)dx (7.5.7) 

ix 

defines a complex inner product on C 2 {X). Namely if / G C 2 (X) with (/, /) = 0, 

def 

then /(x) = 0 dx-almost everywhere: N = [x G X\f{x) ^ 0} cA^isa subset of 
dx-measure zero. But since / is continuous, N is an open subset of X. By (7.5.6) 
we therefore must have N = 0 => / = 0. If/G C 2 (X), then g • f e C 2 (X) 
V g G G by the G-invariance of dx; (g- /)(x) / (g“^ *x) for x G A, as in (7.5.2). 

Thus as in (7.1.14) the equation 

^(g)/ - g • / for {g, f)eGx C2{X) (7.5.8) 



defines a unitary representation ;r of G on C 2 {X) (with respect to the inner product 
defined in (7.5.7)) with the following continuity property: g {n{g) fi, f 2 ) C 2 {X) 

= {^L{g)fuh)LHx,dx) is continuous V /i, /2 G C2{X), by Proposition 7.1. 

Now let Fi, F 2 c C 2 (X) be finite-dimensional, G-invariant complex subspaces: 
/ G P;, g G G =» J^{g)f = g ' f £ Vj, j = \,2. Then by restriction of n to 
Vj we obtain finite-dimensional, continuous, unitary subrepresentations of n, 
of G on Vy The abstract selection rule will involve the representations 
and a third representation constructed as follows. Let 0 be a continuous, 

def 

real or complex- valued function on X such that F 3 = the C-span of the vectors 
{g • 4>\g € G} is finite-dimensional. In practice 4> might not be square-integrable 
with respect to dx. We assume however that 0/i is square-integrable V /i G Fi. 
Note then that for g G G, /i G Fi we have g”^ • /i G Fi (as V\ is G-invariant) 
4^g~^ • /i is square-integrable ==> g • [<pg~^ • /i] is square-integrable 
(as dx is G-invariant). That is, (g • ^)/i = g • [^(g“^ • /i)] is square-integrable 
V g G G ==» V^/i is square-integrable for (i^,/i) G F 3 x Fi: 



F 3 F 1 C C 2 {X). 



(7.5.9) 
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The space V 3 is clearly G-invariant so that we obtain a finite-dimensional repre- 
sentation of G on V 3 given by 

= for(g,/)6GxK3. (7.5.10) 



It will be clear in concrete examples that is continuous. Rather than formulat- 
ing a general statement about continuity we simply assume, for convenience, that 
is indeed continuous. 

Since V2 C C 2 {X) we see by (7.5.9) that Wfi^h) is well defined (i.e., is 
finite) for (v/,/1,/2) e F3 x Fi x F2. In particular if {4>j)]=\ 

are C-bases oiV\,V 2 ,V 3 respectively, the complex numbers 



^sij — \4^s^i 9 ) 



\<s<q 

^<i<P 

l<j<r 



= 4 >s{x)ef\x)ef\x) dx 

. X 



(7.5.11) 



are well defined. Assume, finally, that G is actually compact. Thus, to review our 
assumptions, G is a compact topological group which acts on a locally compact 
topological space X on the left (where the topologies of G and X are Hausdorff), 
and dx is a regular Baire, Borel, or Radon measure on X which is G-invariant 
and subject to condition (7.5.6). For C 2 (X) the space of continuous, complex- 
valued, square-integrable functions on X (with respect to the measure dx) let 
V\,V 2 C C 2 {X) be finite-dimensional G-invariant subspaces (where (g • f){x) 
/ (g~^ • x) for g G G, for a function / on X, and for x G X) and let 
be the corresponding finite-dimensional continuous, unitary representations (with 
respect to the inner product on C 2 (X) defined in (7.5.7)) of G on Fi, V 2 given by 

g ' f g e G, f G 7 = 1,2, as described above. Let ^ be a 
continuous function on X such that 

dcf 

1. 0 is G-finite — i.e., V 3 = the C-span of the vectors {g • ^|g G G} is finite- 
dimensional. 



2. ^/i is square-integrable with respect to dx^ f\ G Fi, 

3. the corresponding representation : G -> GKFs) of G on V 3 given 
by (7.5.10) is continuous. 

As we have noted assumption 2 implies that FsFi C C 2 (X) so that the inner 
product of wfiJi ^ C2(X) is well defined for {y/, f\j2) G F3 x Fj x 

F2, and in particular the integrals Isij in (7.5.11) are well defined. With these 
assumptions (just reviewed) in place one has the following main theorem, where 
we recall that /r* denotes the contragredient representation of a representation n 
of G. Also see Theorem 7.6 for an alternate version of the main theorem. 

Theorem 7.2 (An Abstract Selection Rule). Suppose the representation 0 
;r(3)) 0 does not contain the trivial one -dimensional representation of G; see 




7.5. An Abstract Selection Rule 



187 



example 7 above. Then the integrals Isij in (7.5.11) vanish for all s,i,j. If 
is actually irreducible and is not contained in (g) we may also conclude 
(more simply) that all of the integrals vanish. 



A proof of Theorem 7.2, based on the classical orthogonality relations for rep- 
resentations of compact groups, is presented in Appendix 7A. Before proceeding 
to some concrete applications of Theorem 7.2 we first present an alternative ex- 
pression for the integrals Isij. 



Let Wj]%\ be the C-bases of F,’, 

respectively. Thenforg € G 


V*, V* dual to 


m=l 


(7.5.12) 


S=\ 


(7.5.13) 




(7.5.14) 



n=l 



As ( , ) is G-invariant (i.e., dx is G-invariant) one has V g e G 



def 






,a)^ 






= 2 4>>^^\g)4>,) e®*(;rP)(g)ef>) /, 

n,m,s 



(7.5.15) 



since = Inms by definition (7.5.11). Let [cOij}\<i<p be the basis of 

i<y<9 

(Vi (g) F 3 )* dual to the basis {e\^^ (g) ^j)ij of V\ 0 F 3 . Note that for p = 0 



a>Mg)ief ® 4>,)) = (7.5.16) 

by (7.2.10). If dg denotes normalized Haar measure on G, then by (7.5.15) and 
(7.5.16) 



Itji = 



Itji dg 




n,m,s 



fflmn((^'‘^ ® ^'^')(g)(e'’’ ® 4>t)) (s-< 2 )(g)ep’) dg, 

G 

(7.5.17) 



where the integrands are continuous as functions on G since the representations 
j = 1,2,3, are continuous. Equation (7.5.17) provides for the desired al- 
ternative expression of the integral I'ji where the integration over G, versus the 
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integration over X in the definition (7.5.11), has the advantage that the group 
theory of G readily applies. 



7.6 Some Preliminary Applications of Theorem 7.2 

Fix integers with n,n' > 1, 0 < / < « - 1, 0 < /' < n' - 1. We take 

G = SO(3) with its left action on X = given in example 3 of Section 7.4: 

g • X xg^ for X G a row vector, g G SO(3). In (7.5.1) we choose dx = 
dxdydz = Lebesgue measure on If A is any real 3x3 non-singular matrix 
one has by the transformation of integrals formula 

[ f(xA)dx = ^ [ f(x)dx (7.6.1) 

Jb 3 |detA|J]|{3 

for / G L^{R^,dx). An analogous result holds for E” of course. In particular 
dx is indeed G-invariant; compare definition (7.5.1). Also dx satisfies condi- 
tion (7.5.6); in fact dx is a Haar measure on E^. Let c:i be 

the continuous, irreducible representations of G on the spaces V\ - Sni, V 2 = Sn'p 
generated by the hydrogenic wave functions as de- 

scribed in example 6 of Section 7.1. V\,V2 C C2{X), the space of continuous, 
square-integrable functions on E^, by Proposition 5.2. For pj : E^ ^ E the pro- 
jection of (x, y, z) to X, y, ^ for 7 = 1, 2, 3, respectively, we have seen in example 4 
of Section 7.1 that the C-span of the vectors {g • pj\g G G} (also denoted 

G • Pj) = the C-span of the vectors [pi^Pi^Ps] m the space of continuous func- 
tions on E^. Thus Moreover the natural representation tu of 

G on is continuous and is equivalent to the natural representation ;r of G on 
given in example 3 of Section 7.1. We have seen in fact in example 5 that n 
is in fact irreducible and is equivalent to Each pjy/nim is square-integrable. 
Thus all the hypotheses leading up to the statement of Theorem 12 are fulfilled. 
Since is actually irreducible we may conclude by Theorem 7.2 that 

if is not contained in (8> then all of the corresponding integrals Isij 
in (7.5.1 1) must vanish. In the present case these integrals are the integrals 



*= z) Wnimix, y, z) W„'!'m'(x, y, z) dxdydz, (7.6.2) 

Jr3 

where I < j < 3, -I < m < I, -1' < m' < /'. On the other hand by the Clebsch- 
Gordan formula (Theorem 7.1) 






(/) 



= 






if/> 1 
if / = 0 



(7.6.3) 



Therefore if / > 1, then is not contained in (S) if /' ^ / -h 1, /, and I -I 
in which case 



forall7,/n,m'. 



(7.6.4) 
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If / = 0, then is not contained in if /' 7^ 1 in which case 

IlZ=^ for ally, m,m'. (7.6.5) 



Note that 



rnn'll' 

^jmm' 




'2k 

Pjigir, e , 4 >)) Wnimigir, 0 , (/>)) 

. 0 



• Wrii’Agir, 0, 4>))r'^ sin4> d4>dddr 



(7.6.6) 



with 



Pi (g(r, 0,4))) = r cos 0 sin 4>, 

P 2 (g(r, 9, (p)) = rsm0 cos p, (7.6.7) 

P 3 (g{r,e,p)) = rcosp; 

see definition (4.A.2) in Appendix 4A. 

Since is homogeneous of degree I, Yl^(-x,-y,-z) = (-1)^ Yl^(x,y,z). 
By (5.2.12) it follows therefore that 

¥nim{-x, -y, -z) = (-l)V«/m(^, z). (7.6.8) 



Since Lebesgue measure dxdydz is invariant under the transformation (x, y, z) 
(-X, -y, -z) we can use (7.6.8) to deduce from definition (7.6.2) that 

-y, -z) (-l)^"^^' Wnim(x, y, z) z) dxdydz 

-R3 (7.6.9) 

which shows that 

^jmm' “ ^ if / + /' is even (i.e., if /, /' have the same parity). (7.6.10) 
In particular if / = /' (so that / + /' is even), then 

C'=0- (7.6.11) 

Combining statements (7.6.4), (7.6.5), (7.6.11) one obtains 

Theorem 7.3. IfV 7 ^ /+1 and /-I, then the integrals defined in (7.6.2) (also 
see (7.6.6), (1.6.1)) vanish for all n,n',j,m,m' — i.e., for I < j < 3, —I < m < I, 
-r < m' < /', «,«' > 1, 0 < / < n - 1, 0 < /' < n' - 1. 

By (5.3.9) and (7.6.6) 



jnn'lV __ 
^ jmm' ~ 



jnn'll' 

^jmm' 



•00 


rf 


.0 . 


0 1 



Pj{g{r,e,4>)) R;,(r)^n,(0) x,^{<P) 



■ slii ^ dedd>dr. 



(7.6.12) 
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Consider the integration here with respect to 6: 

c2jt 



def 1 

2n 

def _r_ 
In 



cln 



Pj{g{r,9,4>)) e‘ 






de 



(sin4>) ll’" cos de 



(co#) Jo sin 
(cos4>) ll’' d9 



forj = 1 
for j = 2 
for j = 3 



By the orthogonality relations 



1 

2 ^ 



p2;r 

d9 = Sp^ 
'o 



(7.6.13) 



(7.6.14) 



for integers p,q, and the fact that cos^ = + e '^)/ 2 , sin^ = (c'^ - e 

we deduce by (7.6.12), (7.6.13) the following. 

Theorem 7.4. If ml ^ m+1, m-1, and m then the integrals defined in (7.6.2) 
(also see (7.6.6), (7.6.7)) vanish for all n , /, /', and j. Ifm' 7 ^ m, then = 0 
for all n , /, /'. Ifm' 7 ^ m + 1, m - 1 then both = Ofor all n, n', /, /'. 



in Time-Dependent Perturbations 
and Spectroscopic Selection Rules 



Our discussions regarding selection rules so far have been largely mathematical in 
character. To connect them more directly to physics we indicate how integrals like 
the in (7.6.2) arise in connection with first-order time-dependent perturba- 
tion theory. This theory serves as a platform from which spectroscopic selection 
rules can be derived — i.e., rules by which one determines whether a transition 
from one quantum state to another is allowed or forbidden. Once some more gen- 
eral results are established we can specialize them to determine some allowed and 
forbidden transitions of the hydrogenic atom, given Theorems 7.3 and 7.4. 

In quantum dynamics one considers transitions of a quantum system from one 
energy state to another — a process which is obviously time-dependent, and which 
is governed by the time-dependent Schrodinger equation 



dw h^ 0 

ih-^ = —V^ip + Vyf = Hoiif 
at 2m 



for 



(7.7.1) 



^2 ^ 9^ 

= 1 1 

dx^ dy^ dz^ ’ 



H„ = -—Y^ + V, 
2m 



where we work in for the sake of specificity. Suppose we are given n stationary 
states y/i, . . . ,\f/n • C (or even a countably infinite number of such states, 

as we have had in previous examples): 



HoWj = Ejy/j, Ej e R. 



(7.7.2) 




7.7. Time-Dependent Perturbations and Speetroscopie Selection Rules 



191 



Ej is the energy of the state y/j. If we set 



Tj(x, y, Z, t) = y, z)e (7.7.3) 

then, as observed in Chapter 1, we obtain solutions Tj of (7.7.1). The {y/j]j will 
be assumed to form an orthonormal set. Consider the case « = 2 for example. 
Here we have in mind a quantum system passing from the state T\{x, y, z, t) to 
the state W 2 (x,y, z,t)- During this passage its wave function will be given by a 
superposition 



nx, y, z, t) = {tw, (X, y, Z, t) + a 2 (tW 2 {x, y, 0 (7.7.4) 



of the wave functions W\,T 2 where the coefficients a\,a 2 are time-dependent. 
The passage, or transition, could be triggered for example by the interaction of 
the system with electromagnetic radiation. Mathematically, this means that the 
function T will be subject to Schrodinger’s equation of the form 

ih^ = {Ho + Hi)T (7.7.5) 

ot 

where H\ is a suitable real-valued energy function of position (x, y, z) and time 
t: H\ represents the perturbation of the initial state ‘Fi(x, y, 0 of the system. 
Thus the (time-dependent) Hamiltonian of the system is given by Ho + H\. Since 
*Pi(x, y, t) is the initial state of the system we require that at ^ = 0 (before the 
perturbation is “switched on”) that W and coincide; i.e., 

T(x, y , z, 0) = El (X, y, z, 0) = y/i (x, y, 0, (7.7.6) 

by (7.7.3). 

Let’s consider more generally solutions E of (7.7.5) of the form 

n 

T{x, y, z, 0 = 2 y’ z, 0 (7.7.7) 

;=l 

where the Ej, y/j are as before (with [y/j] j an orthonormal set), where E is subject 
to the initial condition (7.7.6). Now 



dE^ V 

— {x,y,Z,t)lfl = 2a 

* u 



aj(t)—{x, y, z, t) + QjitWjix, y, z, t) 



ih 



(7.7.8) 



= X y, a’j{t)'Fj(x, y, z, t)ih 

j=i y=i 



by (7.7.1), where by (7.7.5) the left-hand side here is 



n 

{Hi + HoW(x,y,z,t) = Hiix,y,z,t)'^aj{t)'Fj{x,y,z,t) 

J=i 
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+ (7.7.9) 

;=i 

since the aj do not involve x, y, Thus 

n n 

J^a'j{t)ihTj{x,y,z,t) - H,(x,y,z,t)'^aj{tWj(x,y,z,t). (7.7.10) 

7=1 7=1 

Note that by definition (7.7.3) 

Tj{x, y, Z, tWiix, y, Z) = vM, y, zWiix, y, z)e~'^>'>^. (7.7.1 1) 

Therefore since {y/j}j is an orthonormal set we multiply both sides of equation 
(7.7.10) by ^(x, y, z) and integrate over to obtain 



/^(0c-'^"/' = X«7■(0 

J=1 



Hi(x,y,z,t) ¥j(x,y, z) 

. JJr3 



• ¥i{x, y, z) dxdydz e 



(7.7.12) 



Given the perturbation Hi, its matrix Hit) with respect to the set {^j] is de- 
fined by 



Hj,{t) = 



Hi (x, y, Z, t)¥j(x, y, z)Wi(x, y, z) dxdydz 

. JJk3 



^{Hii \t)¥j,Wi)- 



(7.7.13) 



H{t) is Hermitian as H\ is real-valued: Hji{t) = Hij{t). By (7.7.12) the coeffi- 
cients ai{t), \ < I < n, therefore satisfy 



a\{t) = -^j^ajit)Hj,{t)e‘^^‘-^>^‘/'‘ 

7=1 



i 



h 



a,it)H„it) -b X ajit)Hj,(t)e>^^‘-^>^'/» . 



(7.7.14) 



Also by the initial condition (7.7.6) 



^i/ = 



W\(x, y, z)y/i(x, y, z) dxdydz 

JJr3 



•F(x, y, z, 0) if/i{x, y, z) dxdydz 

. JJr3 



def 




y/j(x, y, z)i/^/(x, y, z) dxdydz 



= '^aj(0)6ji^a,i0). 
7=1 



(7.7.15) 
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That is, ^i(O) = 1, fl/(0) = 0 for / 7^ 1, which we use in (7.7.14) to obtain 
moreover that for / 7^ 1 



a\(0) = 



i J^aj{0)Hj,(0) + amHui0) 



-rHuiO). 

n 



On the other hand, for / = 1, 



a',(0) = -j amHnit) + ^ «/0)iJ„(0) 



(7.7.16) 



{l.l.n) 



= -tHuH). 

n 

That is, condition (7.7.6) implies 



fli(O) = 1, ai(0) = 0 for / 7^ 1, aUO) = --Hu{0) for all /. (7.7.18) 



We assume W is normalized: 



l = {T,T) = '^ajmit) 

j,l 

= 2 



,i(Ei-Ej)t/n 






(7.7.19) 



Wj(x, y, z, 1) T,(x, y, z, t) = wM, y, zWiix, y, (7.7.20) 

by (7.7.3). The moduli \aj{t)\ in (7.7.19) are given as follows. 



('P(;0,V'/)= 'P(x,y,z,t)yfiix,y,z) dxdydz 



= T 'f'M’ 0 Viix, y, z) dxdydz 

j 

8ji ^ ai{t) 



by (7.7.7), and (7.7.11), which implies that 



K(0P = l<n;0,v^;)P, 



(7.7.22) 




194 



7. Group Representations and Selection Rules 



*P( ; t) being the function (x, y, z) z, t). In (7.7.7) we may write 

= {1.123) 

J 

by (7.7.3), where by (7.7.2) 

{11 lA) 

In other words (7.7.23) is a type of “Fourier expansion” of *P( U) in terms of 
eigenfunctions of Ho (with corresponding eigenvalue Ej), with the 

aj(t) appearing as Fourier coefficients. Using (7.7.19), (7.7.24) we can discover 
the physical meaning of the \aj(t)\^. To this end consider the average or expected 
value (E) of the energy of our system at time t. By definition (3.9.2) 

{E) = {HoE{;tXE{;t)) (7.7.25) 

as Ho is the operator corresponding to the “observable” E, and as E is normalized; 
cf. definitions (6.2.6), (6.2.7). Therefore by (7.7.23), (7.7.24) 

= Y^aj{t)Elit)Eje'^^'-^^^‘^^5j, (7.7.26) 

jJ 

= Y WfEj. 

j 

But if one is given a set of “scores” E\,E 2 , . . . with frequencies /i, / 2 , • • , then 
the average value E of these scores in the ordinary sense is 

E=^fi^ = J^PjEj (7.7.27) 

LjJj j 

where pj = is the probability that a given score has the value Ej. 

Comparing (7.7.26) with (7.7.27) we are naturally lead to regard each \aj{t)\^ 
as a probability (= a frequency of the “occurrence of Ejl' by (7.7.19)). Namely, 
\aj{t)^ is the probability that a measurement of the energy of the system at time t 
will yield the value Ej. We also say (somewhat less precisely) that \aj(t)\^ is the 
probability of finding the system in the state y/j at time t. The conclusion on the 
meaning of the \aj{t)^ also follows by general principles of quantum mechan- 
ics. Compare the remarks following equation (3.9.10). Rather than a systematic 
development of such principles we have chosen to illustrate the same in specific 
situations. 

For physical reasons we see therefore that it is of great importance to compute 
the \aj{t)^, say for j > 2. However there is no general procedure to obtain solu- 
tions of the system of linear differential equations in (7.7.14), even when n = 2. 
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Thus one has no choice but to look for approximate solutions. Here we apply the 
method of first-order time -dependent perturbation theory. The idea here is very 
simple: From the initial conditions fli(O) = 1, a/(0) = 0 for / 7 ^ 1 in (7.7.18) we 
look for approximate solutions afit), say for very small t, by replacing the afit) 
in (7.7.14) by their initial values. Thus, were we to take a\{t) = I, ai(t) = 0 for 
/ 7 ^ 1, for small t, equations (7.7.14) would assume the following simpler form. 

a\(t)--UHn{t) + 0] (7.7.28) 

n 

and for / 7 ^ 1 



a\{t) ^ -- 






iEi-Ej)t/h 



J*l 



term for j = 1 + ^ 

j*u 



(7.7.29) 



= — (term for 7 = 1 ) 
n 

h 



We take the approximations in (7.7.28), (7.7.29) as motivation for the following 
formal definitions of functions af\t), !</<«, where the superscript denotes 
first-order approximation, not the first derivative: 



a^'V) - 7 f Huir) dr + 1, and (7.7.30) 

^ Jo 

dr for/?^l. (7.7.31) 

h Jo 

The constant 1 is added in the definition of af\t) to secure the initial conditions 
= 1, af\0) = 0 for / 7 ^ 1, as in (7.7.18). By definition 



a\^^\t) = -jHn(tl and (7.7.32) 

n 

a]^^\t) = for / 7 ^ 1 (7.7.33) 

' h 

where we now write 

coij = (E, - Ej)/h. (7.7.34) 

The 0)1 j are called Bohr frequencies. In particular we also have 



a|‘^'(0) - -^HuiO) for all I, (7.7.35) 

h 

as in (7.7. 18). Since we used a rough approximation argument to motivate the def- 
inition of the af\t), and also since we have already noted the system in (7.7.14) 
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cannot be solved in general, the af\t) cannot be expected to solve (7.7.14). We 
would like to understand however to what extent they provide an approximate 
solution. Let 






+ 



/ 

h 






(7.7.36) 



Then af\t) solves (7.7.14) A/(t) = 0. The point is to give a bound 

for Ai{t). 



Proposition 7.2. 



|A,(0l < :^ y \ \mj{r)\ dr. 

« ; Jo 



Hence Ai(t) 0 as t 0. 

Proposition 7.2 is proved in Appendix 7B to this chapter. We will mainly be 
interested in the case when H\ is a harmonic perturbation: 

H\ (x, y, t) = V\ (x, y, z) cos cot (7.7.37) 



for some potential energy function V\ . If 




V\{x, y, z) y/j(x, y, z) y. z) dxdydz, 

. JJr3 



then by definition (7.7.13) 



(7.7.38) 



= Vji coscot 



(7.7.39) 



for Hi in (1.131), and in this case 

l^;(0l < ^ X \Vj,\Wij\t (7.7.40) 

j 

by Proposition 7.2. We see that in some sense the af^ are justifiably first-order ap- 
proximate solutions of (7.7.14). One can obtain better approximations (second, 
third-order, etc.), which we will not go into since the first-order approximations 
(first-order perturbation theory) suffice indeed for a wide range of practical appli- 
cations. 

Having noted the physical importance of the \a\^\t)\^ we come now to the main 
theorem of this section. 

Theorem 7.5. For the harmonic perturbation H\ in (7.7.37), and the correspond- 
ing matrix entries Vp in (7.7.38) one has for / 7 ^ 1, 
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sin^(o) + (W/i)^/2 ^ sin^(cw/i -(o)tl2 ^ 



'sin(o>/i - co)t/2^ 



+ 2(cosa,t) 

\ 0)/i - CO / \ CO/1 4- CO / 



w/i^re r/ie Bohr frequencies o)\j are defined by (1.134). 

Proof For I 7 ^ 1, af\t) is given by definition (7.7.31) and equation (7.7.39): 



af\t) 



I 

‘h 
Vu 
' 2h 



(coscor)Fi/e'"'^'‘ dr 

) 

^i{co+(Oii)t _ 2 ^i{con-Q))t _ 



(7.7.41) 



CO + CO/I 



+ 



CO/1 “ 0) 



where we use that 



2 cos (or = e^^'' + e~^^^ 



(7.7.42) 



Then multiplying a\^\t) by af\t) and using (7.7.42) again we obtain 






4^2 



2 — g-‘(o>+(On)< _ gi(a>+ain)t 






^i(Q)n-Q))t ^-i(w+con)t _ j 

- w)(a> + tt>/i) 

^/(c»/i+<w)/^-i(c9/i-«)r _ ^-i{o)n-Q))t _ ^i{con+co)t _j_ j 

(m/i -co)(ffl-l-cu/i) 



l^i/P 

4^2 



(co/i - co )2 
2 - 2 cos(co + co/i)t 



(co + co/i)2 



_j_ _ ^-/((w+(w/i)r 






2 — 2 cos(co/i - co)t 



wfi - ®2 



l^l/p 



(®/l - ffl)2 

2 - 2 cos(co + (Oi\)t ^ 2 - 2 cos(co/i - co)t 






{con - cor 



2 cos 2cot - 2 cos(co + co/i )t - 2 cos(co/i - co)^ + 2 



- ofi 



(7.7.43) 



2 e 



To simplify the latter expression we bring in some trigonometric identities. 2 sin ^ 
= 1 - cos 6 =» 
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Also 



2 - 2cos(o9 + coi\)t 4sin^(G? + con)tll 



{(o + conY 
2 - 2cos(cu/i - (o)t 

- aY 



{co + (OnY 
4sin^(®;i - coYI'l 

(ffl/i - a>Y 



(7.7.44) 



2cos2(»f + 2 = 2(1 +cos2w0 = AcoY cot. (7.7.45) 



Next 



. (a-b\ . (a+b\ 
sm(^— j 

o. .X ■ . (a + b\ 

2(cos b) sin I -Y~ 1 sm ( 1 



COS b- cos a 
2 

cos^ b - cost cos a 



(7.7.46) 

(7.7.47) 



and 

cos X + cos y = 2 cos ( ^ ~ 2 ~ ) (7.7.48) 

=> 2cos(o) 4- cu/i)t + 2cos(cu/i - cu)t = 4(cos(U/i0(cos6t;0. (7.7.49) 

The numerator of the third term in (7.7.43) is therefore rewritten as follows: 
2cos2cot -2cos{co + con)t -2cos{o)n - co)t + 2 

= 4cos^ cot - 4(coso)nt){coscot) by (7.7.45) and by (7.7.49) (7.7.50) 

= S(cos cot) sin(Q)n - co)t/2sin(con +co)t/2, 
by {1.1 Al). Theorem 7.5 now follows by (7.7.43), (7.7.44), and (7.7.50). □ 



The general harmonic perturbation H\ in (7.7.37) will now be specialized. We 
consider an atom subject to an electromagnetic wave. It will experience a change 
in its energy. It is known that the magnetic component of the wave will bear 
less influence on the atom. Any appreciable influence will therefore be due to the 
electric component. As illustrated in Figure 7.1 we take, for example, the direction 
of propagation of the wave to be along the positive y-axis, with the electric vector 
E pointing in the z-direction. 

The magnitude of E at time t and position (x, y, z) is determined by a traveling 
wave expression 



E{x, y, z, t) = Eo cos{Ky - cot) 




(7.7.51) 



where k is the wavelength, k = ^ is the wave number, v = ^ is the frequency 
and Eo is the harmonic amplitude of the wave; compare equation (1.4.7) and 
the basic definitions given there. If y /A is sufficiently small, E{x, y, z, t) is well 
approximated via initial Maclaurin series terms: 



2ny . 

cos cot H sm cot ■ 

X 



An^y^ 



E(x, y, z, t) - E, 



cos cot 



(7.7.52) 
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Figure 7.1: An electromagnetic wave propagated in the j-direction. The electric 
and magnetic field vectors are perpendicular to each other and to the direction in 
which the wave is propagated. 



If for example k is approximately of the order of a few thousand Angstroms 
10“^ cm), as in the case of visible light, then since any distance y measured within 
the atom would roughly be of the order 10“^ cm = 1 Angstrdm, yjk would be 
rather small, in which case one could reasonably choose the approximation 

E{x, y, z, 0 - cos (Dt, (7.7.53) 

neglecting the powers j, in (7.7.52). Equation (7.7.53) means that there is 
effectively a uniform distribution of the electric field over the atom. One further 
point is to be made — a point which we will not attempt to develop here: the en- 
ergy H\, ov Hamiltonian, of interaction of the atom and the electric field will be 
given by 



Hi (x, y, z. t) = ezE{x, y, t) (7.7.54) 

where as before — e is the electron charge. See remarks in Appendix 7D. In other 
words, in summary, an atom which is suddenly subjected to an electromagnetic 
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wave (i.e., to interaction with electromagnetic radiation), with a large wavelength 
X in comparison with the dimensions of that atom, will experience addition to its 
normal energy a perturbing energy of the amount Hi{x,y,zJ) = ezEo cos cot, 
approximately (by (7.7.53), (7.7.54)), where v = ^ is the wave frequency, and 
Eo is the harmonic amplitude of the electric component of the wave, whose elec- 
tric vectors point in the ^-direction (as in Figure 7.1). Thus, up to a reasonable 
approximation, H\{x, y, z, t) has the form (7.7.37), where 

Vx{x,y,z) = ezEo (7.7.55) 



in which case, by definition (7.7.38), 



Vji = = eEo 



zwM^ z) dxdydz 

JJJr3 



= eEo 



P 3 (x, y, z) y/j(x, y, z) y/iix, y, z) dxdydz, 

JJr3 



(7.7.56) 



where again P\,P 2 ,P 3 ^ R are projections onto the first, second, third coor- 

dinates, respectively. 

In the preceding discussion the electric field was assumed to be polarized in 
the ^-direction: the electric vectors were assumed to point in the ^-direction. We 
were thus led to the approximation H\(x,y,z,t) = ezEo cos cot (which is in- 
dependent of y — i.e., of the originally assumed direction of propagation). If the 
electric vectors of the wave point in the x-direction, or in the y-direction we will 
have approximate perturbing energies H\ (x, y, z, t) = exEo cos cot or eyEo cos cot 
with corresponding matrix entries 



Vji = Vf,= eEo 
Vji = = eEo 



Jr3 



JR3 



Pi(x,y, z) wM^y^ z) v^/(x,y, z) dxdydz, 
Pi{x, y. z) wjix, y, z) Viix, y, z) dxdydz, 



(7.7.57) 

(7.7.58) 



respectively. We now see that the integrals /“J,! in (7.6.2) (which initially arose 
as examples of the integrals Isij in (7.5. 1 1)) appear as special cases of the integrals 
in (7.7.56), (7.7.57), (7.7.58). 

From Theorem 7.5 various deductions can be made. 

(i) The “transition probability” \af\t)\^ as a function of co attains its maximal 

dcf 

value at the points u; = ±con = ±(E/-Ei)/^. This result has the following mean- 
ing. The atom will not likely absorb or emit radiation in the transition from the ini- 
tial state \i/\ to the state y/i unless co = ±con — i.e., unless Ei = E\ ±hco = Ei±hv( 
as V = CO jin), which is the Bohr frequency rule; compare equation (1.2.1). In the 
absorption process the atom absorbs energy from the electromagnetic field in the 
amount Ei-E\ = hv. In the emission process energy of the amount E\-Ei = hv 
is added to the field. 

(ii) \af\t)\^ is proportional to the modulus squared \Vu\^ of the perturbation 
matrix entry V\i (which actually follows by (7.7.43)), a fact which is taken as 
axiomatic in Heisenberg’s version of quantum mechanics (in his matrix mechan- 
ics). In (7.7.56), (7.7.57), (7.7.58) |Fi/p in turn is proportional to the modulus 
squared of the harmonic amplitude Eo of the electric vector. The transition 
probability \a\^\t)\^ is therefore proportional to the intensity of the radiation. 
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(iii) If the matrix entry Vu vanishes, then of course vanishes which 

means that a transition from the initial state y/\ to the state y/i is essentially for- 
bidden. We have employed certain approximations in the foregoing analysis. Ex- 
periments show that “forbidden transitions” might actually occur — but such oc- 
currences are very rare, and such transitions produce very faint spectral lines. The 
term “forbidden” therefore is taken to mean most likely forbidden, by far, but not 
necessarily to mean that a transition is absolutely forbidden. 

(iv) Theorem 7.5, up to first-order approximation (which is usually sufficient 
for many practical problems), provides a formula for the probability \af\t)\^ of 
finding a quantum system, subject to a harmonic perturbation H\, in the state i/a/ 
(different from the initial state y/\) at time t\ that is, as pointed out earlier, we have 
a formula for the probability that a measurement of the energy of the system at 
time t will yield the energy value Ei . 

(v) Embodied in Theorem 7.5 is not only a basis by which certain spectroscopic 
selection rules are derived, and a means for determining the probability of a tran- 
sition, but the theorem also serves as a basis by which certain polarization rules 
might be derived. For example, suppose that both and in (7.7.56), (7.7.57) 
vanish. Then in the transition xj/i y/i only radiation in the y-direction could be 
absorbed or emitted — i.e., radiation with a specific polarization. 

7.8 Spectroscopic Selection Rules 
for the Hydrogenic Atom 

We apply the results of Section 7.7 to a hydrogenic atom interacting with electro- 
magnetic radiation. To begin with a specific example, we consider the case when 
the electric field is polarized in the ^-direction. A matrix entry in (7.7.56) then 
has the form by definition (7.6.2). By Theorem 7.3 

= 0 for all «, n\j, m, m' unless /' = / + 1 or / - 1. (7.8.1) 

By Theorem 7.4, = 0 unless m' = m. The following conclusion (selection 

rule) is reached: For the electromagnetic wave polarized in the z-direction a tran- 
sition y/nim Wn'i'm' (from the state y/nim to the state Wn'Vm') is forbidden unless 
/' = / -I- 1 or / - 1 and m' = m. For example, in the beginning of this chapter 
we asked whether the transition (5,2, -2) (5,4, -2) (i.e., y/ 52 (- 2 ) ¥ 54 (- 2 )) 

was always physically possible — and we pointed out that the answer was no, 
as would be seen. We now see why, in the present context, the answer is “no”: 
/ = 2, /' = 4 => I' ^ I + \ and I' ^ I - I, even though m' - m(= -2). It 
is customary to express the condition /' = / -I- 1 or / - 1 as A/ = ±1; similarly 
the condition m' = mis expressed as Am = 0. Thus for the wave polarized in the 
^-direction, the hydrogenic atom can absorb or emit radiation only if A/ = ±1 
and Am = 0. 

Suppose next that the electric wave is polarized in either the x or y-direction. 
Then FjJ in (7.7.57), (7.7.58) have the form eEoI"”^,^ respectively, 

where by Theorem 7.4 both vanish unless m' = m + 1 or m - 1 — i.e.. 
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unless Am = ±1. By (7.7.57), (7.7.58) we therefore obtain the selection rules 
A/ = ±1, Am = ±1 for electric waves polarized in either the x or y direction. 
If A/ = ±1 and Am = ±1, so that the transition xf/nim Wn'Vm' is allowed, then 
by the remarks in (v) of Section 7.7 only radiation of a definite polarization can 
be absorbed or emitted — namely (in the present case) radiation whose electric 
vectors point in the ^-direction. 

7.9 A General Quantum Mechanical Selection Rule 

The abstract selection rule of Theorem 7.2 was formulated in terms of a compact 
group G acting on a locally compact space X and the corresponding unitary rep- 
resentation of G on L^{X), or on C 2 {X). It is possible, and useful, to formulate 
a selection rule even a bit more abstract (and simpler), where the Hilbert space 
L^{X) is replaced by a general Hilbert space H. This we do in Theorem 7.6 be- 
low. The basic new idea needed for this formulation is that of a tensor operator, 
to use the terminology of physicists. In practice H will be the Hilbert space of 
some quantum mechanical system. Thus it is appropriate to refer to Theorem 7.6 
as a quantum mechanical selection rule. 

Again let G be a compact group as above, let ;r be a continuous unitary repre- 
sentation of G on a complex Hilbert space H, and let be a continuous rep- 
resentation of G on a finite-dimensional vector space F 3 . The use of the notation 
and use of later notation will be for the sake of easy comparison with that 
of Theorem 7.2. Let B(H) denote the space of bounded linear operators A on H: 
\\Aa\\ < M||a|| for some M > 0 where the norm ||a|| of a vector a e H is 
induced by the inner product ( , ) on 7/: ||a|p = {a, a). A tensor operator T, 
relative to tt, is a linear map T : F 3 ^ B(H) such that 

n^^^\g)v) = n{g)T(vMg-^) for {g, V) G G X Vi. (7.9.1) 

In regards to such kind of operators, one has (for the above notation) 

Theorem 7.6 (A General Quantum Mechanical Selection Rule). Let T : L 3 

^ B(7Y) be a tensor operator relative to n, Let V\, V 2 C H be finite- 
dimensional TV -invariant subspaces, and let be the corresponding con- 

tinuous subrepresentations ofG on V\, V 2 . If the trivial one-dimensional repre- 
sentation lofG is not contained in <S> <S> then 



(T(v 3 )vi, V2) = 0 for all (vi, V2, V3) e Fi x F 2 x F 3 . (7.9.2) 

is actually irreducible the conclusion (7.9.2) is still valid provided that 
is not contained in <8) 

It is clear by our choice of notation that Theorem 7.6 compares indeed with 
Theorem 7.2, where the integrals Isij there (given by the inner products in (7.5. 1 1)) 
are replaced by the inner products in (7.9.2). 

As an example, suppose in fact that in Section 7.5 one has that for each / e 

def 

V 3 = G • (p, the corresponding multiplication operator Mf (fi f f\,f\ ^ 
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L^{X, dx)) belongs to dx)). Then we have a linear map 

T : F3 ^ M{L\X, dx)) given by T(f) Mf. (7.9.3) 

The identity g • [/ (g“^ • /i)] = (g • /) /i (which was also used to establish (7.5.9)) 
translates to the statement that T is a tensor operator relative to ir, 7 t^^\ Regarding 
V\,V 2 C C 2 (X) in Section 7.5 as finite-dimensional ;r-invariant subspaces of 
H = L^(X, dx), we conclude from Theorem 7.6 that 

h(x)fi{x)T2(x) dx = {Tih)fuh) = 0 (7.9.4) 

. X 

for h) G Fi X F2 X F3, if the trivial representation 1 of G is not contained 

in 0 0 (or if is not contained in 0 in case is irre- 

ducible), which provides the conclusion of Theorem 7.2. A proof of Theorem 7.6 
is also given in Appendix 7A. 

7.10 Group Invariance of Schrodinger’s Equation 

For the Hamiltonian 









(7.10.1) 



equation (5.1.27), which is the first equation in (5.1.9), can be written as 



Hfi=Eifu (7.10.2) 

which as we have seen is solved by the square-integrable hydrogenic wave func- 
tions y/nim for El = Now since any A G 0(3) preserves 

the norm 1| ||, where ||(x, y, ^)|| = it follows from (7.1.10) that 

A also commutes with H : For suitable / : C 

[H(A-mx,y,z)= -^{X\A-f)){x,y,z)-Ze^ ll(x,y,^)ir‘ 

2fi 

= —(A ■ vV)(x.y, z) - Zg2 \\(x,y, ^)|r> 

= -^(VV)((x,y,z)A)-Ze2 ||(x,y,z)ir', 

2fi 

[A ■ iHf)]{x, y, z) = {Hf){{x, y, z)A) 



whereas 
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Theorem 7.7. Let H be the Hamiltonian in (7.10.1). Thus the hydro genic wave 
functions y/nim of Chapter 5 are solutions of the Schrodinger equation Hy/ = Ey/ 

for E = = -pZ^e^ /2h^n^, n= 1,2,3, Given equation (7.1.10) (whose 

proof will be given in Appendix 1C) one also has that each A e 0(3) commutes 
with H: 



H(A • y/) = A • (Hy/), where 

def , (7.10.4) 

(A • \f/)(x, y, z) = wiix, y, z)A) for y/:M? ^C. 

In particular ify/ is a solution of the Schrodinger equation Hy/ = Ey/, so is A- y/ 
(for the same eigenvalue E, which is Wignefs 1927 observation). 

Many applications of group theory to quantum mechanics are based on the 
commutativity of the Hamiltonian of some system with the action of some sym- 
metry group G, as illustrated in Theorem 7.7, for G = 0(3). Thus Schrodinger’s 
equation for the system is invariant under G. In Chapter 10 we shall consider the 
commutativity or invariance condition (7.10.4) for A € G = 5„, the symmet- 
ric group on n letters, in connection with a system of identical particles. For that 
discussion it will be useful to have a description of the one-dimensional represen- 
tations of This we take up in the next section. 



7.1 1 The Representations 1"^ of and the 
Irreducible Representations of 

Besides the symmetry group 0(3), another one of great importance in quantum 
mechanics is the symmetric group G = Sn of permutation of n letters, say X„ = 
{1,2, as we shall note in Chapter 10. As usual, any a € G (o being a 
bijection of 2f„) is written 

-Ga, .m ::: 

There is a natural representation ;r of G on C". 7r(o) : C" ^ C" is given by 



def , 



7t{(7)iVi, . . . , V„) = , Va(„)) 

for (vi, . . . , v„) G C". Indeed 



def 



/t(0\02)(V\, . . . , V^) — (V(crjo-2)(l)» • • • » ^(ci<T2)(n)) 



def 



— (^0-i(<T 2(1))> • • • > ^0’i(ff2(«)))’ 
def 

;r(c7i)(;r((r2)(vi,...,v„)) = ;t((Ti)(v^j( 1 ), . . . , v„j(„)) 

=» n(<j\G2) = ^(<^\)^((^i) foro-i,(72 G G, 



(7.11.2) 



(7.11.3) 



which shows that ;r is a representation, which is automatically continuous since 
the topology of a finite group is discrete. To find the matrix version of n, let 
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the standard basis of C": C; = ( 0 , ... , 0 , 1 , 0 , ... , 0 ) where the 1 ap- 
pears in the 7 th place. Write ej = . . . , i.e., = Sji. By (7.1 1.2) 

_yo) _v. - 

l =\ /=1 

In particular the matrix of 7 t(( 7 ) is given by 



7r((j) = 





^cr(l)n 


<5ff(2)l • ■ 


^cr(2)« 




^(r{n)n 



Note that 



i.e., 7 t{(j)ij — 



n 

^ ^c{i)k^a{j)k 
^=1 






(7.11.5) 



(7.11.6) 



For if the left-hand side of (7.11.6) is not zero we must have S(j{i)kda{j)k ^ 0 for 
some k => a{i) = k = g{j) ==> i = j. That is, / ^ j ==» the left-hand side 
of (7.11.6) is zero. If i = j however, then the left-hand side of (7.11.6) is 



n 

^o(i)k^(r(i)k — — 1? (7.11.7) 

k=l 



which proves (7.11.6). But given (7.11.5), (7.11.6) is just the statement that 



[7T((7)7t((7y]ij = ^ 7T{(j)ik7ri(j)jk 
k=\ 



n 




(7.11.8) 



i.e., 7 r(( 7 ) 7 r(( 7 y = 1 => each ;r(cr) 6 0(n). In particular [det;r(cj)]^ = 1 => 



det;r((T) = ±1 for each a e (7.11.9) 

It is easy to construct the one-dimensional trivial representation 1‘‘‘ of as a 
subrepresentation of jt. Namely define 

Vi = C(l, 1, . . . , 1) = C(ei + C 2 + • • • + Cn), (7.1 1.10) 



which is a one-dimensional subspace of C" on which tt acts trivially: By (7.11.4) 



7r{(7){ei + C2 + • • • + c„) — Cfr-I(l) + Cfy-1(2) + • * • + 

= e\ + 62 + ' ' ' + ^n. 



(7.11.11) 
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Thus 1“^ = n\v^. Given w = (wi, W 2 , . . . , w„) G C” define 
1 ” 

A = - '^Wj e C, Vj = Wj - A e ' 

Then 



y=i 



n n 



j=\ j=\ 



def 



V = (vi, . . . , v„) belongs to the ;r-invariant subspace 



{' 



1^2 = ^ V = (vi, . . . , V„) G C" 1 ^ v^- = 0 

;=i 



(7.11.12) 



(7.11.13) 



(7.11.14) 



of C". Also V + >1(1, 1, . . . , 1) = (vi + >^, V 2 + >1, . . . , v„ + >^) = (wi, . . . , w„) = w 
which shows that 



C” = Fi©F 2 , ;r=l+©;T 2 for;r2=Sr|K2 (7.11.15) 

since clearly V\C\V 2 = {0} : v = (vi, . . . , v„) = >^(1, 1, . . . , 1) with v G F2 => 
0 = yj - nX => A = 0 v = 0. We also see in (7.1 1.15) that n is not 
irreducible of course. 

The group S„ has another one-dimensional (non-trivial) representation 1“ which 
is constructed as follows. 1 ”((t) : C ^ C is given by 

Hpf 

= (dQi7r{a))z for^GC. (7.11.16) 

Since det;r(cr) 7^ 0 (det;r(t7) = ±1 by (7.11.9)), l“(o') is indeed a non-singular 
linear operator on C such that for 01,02 e S„, \~{o\ 02 )z = (det;r(o-io-2))z = 
(det(;r(o-i)/r((72)))z = (det;r((7i) det;r(o-2))^ = l“(tri) l~(o 2 )z', i.e., 1“ is indeed 
a representation of Sn. The following result is known. 

Theorem 7.8. Up to equivalence 1"^ are the only one-dimensional representations 
of Sn. 

An important special case, which arises often enough in chemistry, is the case 
« = 3 . Here 53 is isomorphic to D3, where in general D„ is the dihedral group of 
order In — the symmetry group of a regular «-gon. We use this case to illustrate 
some of the preceding remarks. Write ^3 = {/, A, B,C,D,E], where 




(7.11.17) 
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cf. (7.11.1). The Ttia) in (7.11.5) are given by 



0 0 1 0 10 [10 0 

k{A)= 10 0, 001, jt{C)= 0 0 1 

0 1 oj ® [010 

To 0 il To 1 ol 



1 0 0 



001 010 

jr{D)= 0 10, m(E)= 1 0 0 



0 0 1 



(7.11.18) 



Of course n{I) is the identity matrix. A basis for the subspace V 2 in (7.11.14) is 
{ci - C 3 , C 2 - C 3 }. Relative to this basis one has 



ziiA) = 


-1 -1 

1 0 


, ^2{B) = 


ttiiD) = 


-1 -1 

0 1 


, ^2{E) = 



0 1 
-1 -1 

0 r 

1 0 ■ 



^liC) = 



1 0 
-1 -1 



In particular we obtain the following character values: 

/;t2(-f) = 2, /2,,(A) = -1, = 

X.,(C) = 0 , X,,(D) = 0 , /,,{£) = 0 . 



(7.11.19) 



(7.11.20) 



Now apply the Frobenius criterion: For a representation r of a finite group G, r 
is irreducible if and only if its character satisfies 



order of G 



X l4'r(g)l^ = 1. 



Given (7.1 1.20) we indeed have for G = S 3 , r = K 2 , 

7 ^ I/;t 2(^)P = -[4+l + l+ 0 + 0 + 0] = 1. 



(7.11.21) 



(7.11.22) 



Thus 7 T 2 is irreducible and in (7.11.15) we have the complete reduction of ;r = 
l"^ © ;t 2 as a direct sum of irreducible representations. 

Note that 1" in (7.11.16) is given as follows, where Ic is the identity operator 
on C: 



1-(A) = l-(B) = l-(/) = Ic, 
1-(C) = 1-{D) = r{E) = -h 



(7.11.23) 



det;r(o’) = det;r 2 (o') = 1 for cr = A, B, I, 
dQt 7 r((T) = det;T 2 (<T) = -1 for (t = C,D,E 



(7.11.24) 
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by (7.11.18), (7.11.19). In other words l”(a) = ±lc according as a is an even 
(+) or odd (-) permutation; the same observation is in fact true for Sn in general. 
C for example is an odd permutation as it is the product of r transpositions with 
r = 1; namely C is the transposition 2 3, 3 ^ 2 with 1 being fixed. 

Finally one can deduce Theorem 7.8 in the present case via the completeness 
relation for a finite group G: If ti , . . . , is a list of its irreducible representations, 
up to equivalence, then 

m 

^[dimension Tjf = the order of G. (7.11.25) 

j=i 

Since I"*", ;t 2 are irreducible of dimensions 1, 1,2, and since 1^ + 1^ + 2^ = 6 = 
the order of S 3 , where no other solutions of equation (7.11.25) exist, we have in 
fact constructed all of the irreducible representations of S 3 , up to equivalence, 
and have verified Theorem 7.8 in particular for « = 3. 

Extended and systematic applications of group theory, more than that which we 
have attempted to undertake here, can be found in the references [79, 83, 87], for 
example. 

Appendix 7A Proof of Theorems 7.2 and 7.6 

Given the formula (7.5.17) we show how Theorem 7.2 follows from a more gen- 
eral result presented here. First we recall the notion of a sesquilinear form. 

Let V\, V 2 be complex vector spaces and let 5 : Fi x ^ C be a function 
which satisfies B(x,yi + ^ 2 ) = B(x,y\) -f- B(x,y 2 ), B{x\ -H X 2 ,y) = B(xi,y) + 
B(x 2 , y), B(cx, y) = cB(x, y), B(x, cy) = cB(x, y) for x, xi,X 2 eVi, y, y\,y 2 e 
V 2 , c eC, where c is the complex conjugate of c. Then B is called a sesquilinear 
form on Fi X > 2 . If A : Fi ^ is any linear map and ( , ) is a complex inner 
product on V 2 then there is a corresponding sesquilinear form B = Ba,{ , ) on 

def 

V{ X V 2 given by B(x, y) = {Ax, y) for (x, y) eV\ x V 2 . Conversely one has 

Proposition 7.3. Let B : V\ XV 2 ^ C be a sesquilinear form where (F 2 , ( , )) 
is a finite -dimensional inner product space. Then there is a unique linear map 
A:V\-^V 2 such that B{x, y) = {Ax, y) V(x, y) eV\ x V 2 . 

Proof Given x G Fi, define : V 2 ^ C by Ljfy) = B{x, y, ) for y G F 2 . As B 
is sesquilinear. Lx e Vf. Hence there exists a unique element x* G F 2 such that 

Lx(y) = {y,x^) VyGF 2 ; (7.A.1) 

i.e., (x*,y) = Lx(y) = B(x,y) Vy G F 2 . Define A : Fi ^ F 2 by Ax = x*. 

def 

Then for xuX 2 e Vu y e V 2 , ((xi + X 2 )*,y) = B(xi + X 2 ,y) = B(xuy) + 

def 

B(x 2 , y) = (xj, y) + (X 2 , y) = {x\ x^, y) => (xi -f- X 2 )* = x\+ x^. Similarly 

(cxi)* = cxj for c G C ==> A is linear. A is clearly uniquely determined by B. 

Thus Proposition 7.3 is established. □ 




Appendix 7A. Proof of Theorems 7.2 and 7.6 



209 



Note that if (V 2 , ( , )) is an infinite-dimensional Hilbert space (a case which we 
shall not need) and A are required to be fc6>Mnrferf(i.e.,|5(x,y)| < Mi||x|| ||y||, 
l|Ax|| < M 2 IIXII for some Mi, M 2 > 0 and some norm || || also on V\), then 
Proposition 7.3 remains valid since equation (7.A.1) would follow by the theorem 
of Riesz. In fact we would have ||x*l| = HL^^II and consequently ||A|| = \\B\\. 

Now let 7Ui , 7T2 be finite-dimensional continuous representations of G on com- 
plex vector spaces W, Vi, say that K 2 is unitary with respect to the inner product 
( , ) on F 2 - Here we take G to be a compact topological group (with a Haus- 
dorif topology), and we denote normalized Haar measure on G by dg. For fixed 
(cu, L) e W* X V 2 there is a natural sesquilinear form B onW XV 2 given by 



B(w,y) = Bco,L(w,y) = 



(o{7ri{g)w) L(7r2{g)y) dg. 

io 



(7.A.2) 



Since jr\,7T2 are continuous the integrand in (7.A.2) is continuous; cf. the defini- 
tion at the close of Section 7.1. By Proposition 7.3 there is a unique linear map 
A = Ao),l : W ^ V 2 such that B(w,y) = (Aw,y) on IF x V 2 . For an arbitrary 
element gi G G and (w, y) eW XV2 



Bi7ri{gi)w,7r2(g\)y) = o)(7ri{ggi)w) L(7r2(gg\)y) dg 
JG 

(since 7rj{g)7rj{gi) = ;r^(ggi) for; = 1,2) 



= 0)i7ri(g)w)L(7i;2(g)y) dg 

JG 

(by the G-invariance of dg) 
= B(w,y). 



(7.A.3) 



That is, {Ajiiig)w, jr 2 {g)y) = (Aw,y) = {ir 2 (g)Aw, M 2 (g)y) (since 71:2 is unitary) 
V(w,y) =» 



Anxig) = K2{g)A : 



(7.A.4) 



The diagram 



4 

F2 ^ F2 

^2(g) 

is commutative Vg G G; compare examples 2, 4, 5 in Section 7.1. That is, A : 

dcf 

IF F 2 is a G-map: A G HomG(IF, F 2 ) = the space of linear maps A : IF ^ 
V2 such that (7.A.4) holds. Now we use the following standard fact from finite- 
dimensional representation theory, which amounts to the classical orthogonality 
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relations for a compact group: 



dim }iomG(W,V 2 ) 



L 

L 



(g)Xn2{s) dg 



X>^2(g)X>^,(g) dg = dimHomG(K 2 , W), 



(7.A.5) 



where as before denotes the character of the representation jr. In particular if 
A ^ 0 then HomoiVi, W) 7 ^ [0]: V 2 W a. non-zero, linear G-map. The 

kernel of ^ is 7T2 invariant (since ^ is a G-map) and differs from V 2 (since ^ 7 ^ 0). 
Hence if 7t2 is irreducible we must have the kernel of ^ = {0} ; i.e., ^ is one-to- 
one and thus, by way of V 2 is equivalent to the subrepresentation 7U\ of 7T\ , 
where the range ^(^ 2 ) is indeed ;ti - invariant by the G-invariance of 0; i.e., (as 
defined in Section 7.3), 



7t2 is contained in Jti if RomG(V 2 , W) 7 ^ {0} and Jt 2 is irreducible. (7.A.6) 



In other words if tu 2 is irreducible and 7U2 is not contained in tti , then we must 
have A = 0=»J5 = 0=^(by definition {1A2)) 



Q)i7ri(g)w)L(K2(g)y) dg = 0 (7.A.7) 

JG 

for all (w,y) £ W XV 2 , (co,L) £ W* x If 7t2 is not necessarily irreducible 
we use (7.A.5) to write 



dimHom^;(R^,F2) 



X7c,(g)X7C2(s) dg 

JG 

XKi(g)X4(g) dg = \ Xjri®^;{g)dg 

JG JG 

L 



X7c,^^;(g)xi(g) dg 



dimHomG{C,W 



(7.A.8) 



by standard elementary properties of characters, where 1 is the trivial representa- 
tion of G on C. As 1 is irreducible, statement (7.A.6) applies: If 1 is not contained 
in 7Ti 0 7^2, then UomGiW, V 2 ) = {0} by (7.A.8) and since A £ HomG(R^, V 2 ) 
we must have A = 0 => JB = 0 so that again we arrive at (7.A.7), which proves 

Theorem 7.9. Let k\ , K 2 be finite-dimensional, continuous representations of G 
on complex vector spaces W, V 2 with equal to the contragredient representa- 
tion of K 2 acting on Vf . If k\ ^ does not contain the one-dimensional trivial 
representation ofGonC then for all (w, y) £W XV 2 , (cu, L) £ W* x Vf 

Qj{7ri{g)w)L(7t2(g)y) dg = 0. 

JG 



This equation also holds under the assumption that K 2 is irreducible and is not 
contained in k\. 
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We have established Theorem 7.9 strictly speaking only for ki unitary. But as 
is well known this is essentially a non-assumption. Namely if ( , )o is any inner 
product whatsoever on V 2 , then the equation 



(V2, V 2 ) 



. G 



(7.A.9) 



defines an inner product ( , ) on F 2 for which ki is unitary. 

Theorem 7.2 follows from Theorem 7.9. Namely, assume that < 8 ) ® 

does not contain the trivial one-dimensional representation of G; or if is 
actually irreducible, assume that is not contained in In Theorem 7.9 

choose 7i\ - ( 8 ) and 712 = Thus we may conclude that 



0 .;r^^^)(g)w)L(;r^^^(g)y) dg = 0 (7. A. 10) 

. G 

for all (w, y) e {Vi 0 V 3 ) x V 2 , (cu, L) G (Fi 0 K 3 )* x V^. In particular by (7.5.17) 
we conclude that itji = 0 Vt, 7 , /, which proves Theorem 7.2. 

For the proof of Theorem 7.6 we make use of the notation established in Sec- 
tion 7.9. Thus let T : K 3 M{H) be a tensor operator relative to tt, and let 
V\,V 2 C H be finite-dimensional ;r-invariant subspaces with denoting 

the corresponding subrepresentations. As T : F 3 B(7f) is linear and satis- 
fies (7.9.1), it is immediate that the equation 

KV3,Vi) =^T(V3)Vi, (V3,Vi) G F 3 X Fi (7.A.11) 

defines a bilinear form h = br \ V 3 xVi ^ H, which satisfies the G-invariance 
condition 



h{n^'^\g)v3,7t{g)\x) = K{g)b{v3,vx). (7.A.12) 

By the definition of F 3 0 V\ given in Section 7.2 (the tensor product being taken 
over C) there exists a unique linear map b = bj ^ H such that 

b{y3 0 vi) = ^(v 3 ,vi) = T(v 3 )vi for (v 3 ,vi) g F 3 X V\. (7.A.13) 

For the tensor product representation 0 one has by (7. A. 12) 

® ^r<'*)(g)(v3 ® Vi)) ® ;r<’\g))(v3 ® vi)) 

=^K;r'^>(g)v3 ® ^^'^(g)vi) bi7i^^\g)vi,ji{g)vi) 

dcf 

= ^(g)6(v3,vi) = ;;r(g)i)(v3 ® vi), 

(7. A. 14) 



which shows that b intertwines ® ;r^’* and n. That is, 

® ;r^'*)(g) t) = K{g)b(t) for (g, t) E G x (F 3 ® Vi). 



(7.A.15) 
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Next define B = Bt '■ {V 3 ® Vi) x. V 2 ^ C by 

Bit, V 2 ) = (bit), V 2 > for it, V 2 ) € (F3 ® Fi) X F 2 . (7.A.16) 

As b is linear and the inner product ( , ) on 7/ is sesquilinear it follows that B is 
a sesquilinear form, and therefore by Proposition 7.3 there is a unique linear map 
A = Aj : F 3 0 Fi ^ F 2 such that 



V 2 ) = {At, V 2)2 for (t, V 2 ) 6 (F 3 0 Fi) X F 2 , (7.A.17) 

where ( , )2 is the inner product ( , ) restricted to F 2 . Now for g 6 G Ag 
(g~^)A{ 7 T^^^ 0 : F 3 0 Fi F 2 is a linear map, and since tt is unitary Ag 

also satisfies 

iAgt, V 2>2 = {Ain^'^^ ® ^‘^’(g)V 2)2 

def. aA.17) gj ;j.a))(g),^ 2r‘^^(g)V2) 

def. (XA.16) g, ;ra))(g)^^ Jr‘^\g)V 2 ) (7 A 18) 

= i7iig)bit),nig)n) (by (7. A. 15)) 

~ . , def. (7.A.16) „ . 

= ibit), V 2 > = Bit, V 2 ) 

for every (r, V2) e (F3 0 Fi) x F2. 



By the uniqueness of A we must therefore have Ag = A : A(;r^^^ 0 = 

;r^^^(g)A: The diagram 



F3 0Fi 






F30F1 



V 2 



;r(2)(g) 



V 2 



(7. A. 19) 



is commutative: A € HomG(p 3 0 Fi, F 2 ). This puts us in position to apply ex- 
actly the arguments that followed (7.A.4), where n\ there is taken as 0 
here. We can therefore conclude the following. If the trivial one-dimensional rep- 
resentation 1 of G is not contained in 0 0 , or if is actually 

irreducible and is not contained in 0;r^^\ then A = 0 and hence (by (7. A. 17)) 
B = 0 => {b{t), V 2 ) = 0 for (t, V 2 ) e (>3 0 Fi) X F 2 (by (7.A.16)). In particular 
(T(v3)vi, V2) = 0 for (vi, V2, V3) G Fi X F2 X F3 by (7.A.13), which proves The- 
orem 7.6 since of course the representations 0 7 t^^\ 0 are equivalent 

(via the isomorphism Fi 0 F 3 F 3 0 Fi). 
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Appendix 7B Proof of Proposition 7.2 

Here we establish the proof of Proposition 7.2. First take / = 1: 
Ai(t) = 



m 



def I „ . . 

= ~ r^iiCO + 

n 






i;H)l 



Hn(r) dr Hyiit) 






For / # 1 



n 



i 



( ^). 



+ dr + 1^ 



(= termy 



In particular by (7.B.1) and (7.B.2) 

I >4 /.M ^ l^llWI r I tr / M J 

|Ai(f)| < — ^ l^fiiWI dr 

^ y f \H^j{r)\ dr \Hjdt)\ 

^ ;7?J« 



+ 



= ^2l«y.(')l 



\mj{r)\ dr. 



(7.B.1) 



= 1 ) 



(7.B.2) 



(7.B.3) 
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and for / 7^ 1 



That is, 



f' 

1 y \Hji(t)\ \ \H^j(r)\ dr 
^ 1 Jo 



\Mt)\ < |ffi,(r) dr 



+ ^\Hu(t)\^JHnir)\dr 

= ^'Z\^jiit)\^JHij{r)dr. 

\Mt)\<^'^\Hj,(t)\^JHij(r)\dr 



for every I, as desired. 



(7.B.4) 



(7.B.5) 

□ 



Appendix 7C Proof of Equation (7. 1.10) 

For a fixed n x n real matrix g = [gtj] define y/ = y/g : W by y/ = 

{y/i,. . y/n) where y/j : R" R is given by 

n 

¥j(.x) - ^ gpXi forx = (xi,...,x„) eM". (7.C.1) 

(=1 

Let / : R" R or C have continuous first and second partial derivatives: / is a 
-function. Define JF : R" ^ R or C by F’ = f oy/: F(x) = f (y/\(x ), . . . , y/n{x)) 
for X G R". If Di denotes partial differentiation with respect to the iih variable, 
then as / is of class the chain rule gives 

n 

(DiF){x) = y^{Djf)iiff(x))iDiy/j){x) 

J=i 

n 

= y^gjiiDjf){yf{x)). 



n 

Diif gji(Djf) ° ¥■ 

M 

As / is of class we may replace fhyDjf here; 



(7.C.2) 



(7.C.3) 



DiliDjf) o^r] = y^ gkiiDkDjf) O i/f 



(7.C.4) 
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Then for 



^ D, A(/ o V^) = 2 gjiMiDjf) o W\ (7.C.5) 

y=i 

n n 

= 2 X Ski(DkDjf) o }fr. 
j=\ k=\ 



JL JL 



, = 1 i=l 



(7.C.6) 



n n n 



v^(/ov^) = ^A'(/°v^) = III gjigki(DkDjf) o If/ 

i=l j = \ k=l 



/ = 1 

n n n 



-111 gjiig'MDkDjf) o If, 
j=i k=i <=i 
n n 

= 2 ^(Sg‘)jk(DkDjf) o y/. 

j=\ k=l 



(7.C.7) 



In particular if g G 0(n) (i.e., gg^ = 1) 

n n 

V^(/ ° ¥g) = 11 Sjk(DkDjf) o y/g 



j=\ k = l 



(7.C.8) 



= oy/g = l^Djf\oii/g = (VV) o y/g. 



7=1 



W=1 



However / o y/g = g^ * / by the definition given in example 5, compared with 
definition (7.C.1). (7.C.8) therefore reads V^(g“^ • /) = g“^ • V^/ for g e 0(n), 
which proves (7.1.10). 



Appendix 7D Remarks on Equation (7.7.54) 

A reason for equation (7.7.54) is roughly the following. If E were a static field 
(which is manifestly not the case, as it is time-dependent) the energy F of a 
particle of charge q under its influence would be given by 



V{p) = -q 



[ E -dr 



Jo 



(7.D.1) 



at a point p E R^. Compare equation (2.2.8) and Appendix 8A, equation (8.A.5). 
As E is directed along the ^-axis, V assumes the specialized form 

K(0,0,z) = -^[ E(0Az\t)dz' 

Jo 

-qzE(x,y,z,t) 



(7.D.2) 
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since by (7.7.53) E{x, y, z, t) is independent of (x, y, z), up to an approximation. 
For a hydrogenic atom, the case of main interest to us, we choose q = -eio obtain 

V{Jd,Q,z)^ezE{x,y,zJ). (7.D.3) 

Thus we obtain the right-hand side of (7.7.54), by admittedly rough consider- 
ations. The reader may wish to consult an appropriate physics text for further 
discussions of equation (7.7.54). 
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The Quantized Hamiltonian 
for a Charged Particle in an 
Electromagnetic Field 



8. 1 The Motion of a Charged Particle in an 
Electromagnetic Field 

Electric fields and magnetic fields exert a manifest influence, or effect, on the 
energy levels of an atom. These effects, revealed for example by the splitting of 
spectral lines, are called Stark and Zeeman effects, respectively. The Stark effect 
(first observed by Johannes Stark in 1913) plays an important role, for example, in 
theoretical attempts to understand the formation of molecules from atoms. Our in- 
terest will be in the Zeeman effect — especially as we consider the notion of “spin,” 
and spin wave functions in the next chapter, particularly in Section 9.3 there. 

To prepare for that section we shall need to understand how to quantize the 
classical Hamiltonian that governs the motion of a charged particle in an electro- 
magnetic field. The ideas and results here of course are of independent interest. 
We draw upon the resources of Chapters 2 and 3. Our basic goal (to be realized 
in the next chapter) is to compute energy levels {Landau spectra), and spin wave 
functions of hydrogenic atoms under the influence of a uniform magnetic field. 
Also we compute the Landau spectrum (or Zeeman effect) in another simpler 
example. 

We shall employ the Gaussian system of units mentioned in Chapter 0: an elec- 
trostatic unit of charge (1 esu or 1 stat-Coulomb) is that amount of charge on a 
test object for which an equal object of like charge placed 1 cm away is repelled 
by a force of 1 dyne, both objects considered placed in a vacuum. 

To begin the discussion, consider the (non-relativistic) motion of a particle with 
a charge q in an electromagnetic field {E, B). That is, E and B are functions: 
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X R ^ depending on position (x, y, z) and time t, which represent the elec- 
tric and magnetic components (respectively) of the field, and which in practical 
situations will be given by some scalar and vector potential A = (Ai , A2, A3) : 
R^ xR^R,Rl- 



E = 
B = 



34) 


1 dAi 


d4> 


IdAi 


dx 


c dt 


dy 


c dt 




3A2 dAi 




dAi dAi 


dy 


dz ’ dz 




dx ’ dx 



d(p 1 dA3 \ 
ll~~c^ /’ 

■ dy ) 



( 8 . 1 . 1 ) 



for c = the speed of light. ^ reminds us of course of the role of the electrostatic 
Coulomb potential; compare definition (2.5.4) and the equations of (2.5.7) for 
example. The vector potential A allows for generalization of Oersted and Faraday 
magnetic induction laws. Namely, use (8.1.1) to directly compute 



curlE 



V dy 



dEi dE\ 
dz ’ dz 



dE^ dE2 dE\ 
dx ’ dx dy 



( 8 . 1 . 2 ) 



Assuming that 0, A are of class C^, so that one has equality of mixed partials, the 
result is independent of 



curlE 

c \d^ \ dy dz ) ' dt \ dz dx ) ' dt \ dx dy ) ) (8- 1*3) 

= _ 1 ^ 
c dt 

It is customary to express (8.1.1) as 

E = -V4>--—, B = cmlA. (8.1.4) 



The computation leading to (8.1.3) just mentioned simply amounts to the fact, of 
course, that the curl of a gradient vanishes. Similarly, because of the equality of 
mixed partial derivatives, the divergence of a curl vanishes: for A of class 



def d^i d^2 dBi, 

div B = = 0 

dx dy dz 



(8.1.5) 



by direct computation, by (8.1.1). Equation (8.1.3) (Faraday’s law) and equa- 
tion (8.1.5) (the no magnetic charges law) are part of Maxwell's equations which 
govern the propagation of an electromagnetic field. There are two other Maxwell 
equations (Gauss’ law and Ampere’s law) which we shall not venture to discuss. 



Example 

For B e R fixed define a map 5 : R^ x R ^ R^ (a constant magnetic field 
along the z-axis) by 



B{x,y,z,t) = {0 A B). 



( 8 . 1 . 6 ) 
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It is trivial to construct a vector potential A : x R - 

. def / By Bx 

A{x,y,zJ) = — T-.-TT’O 



(8.1.7) 



B = curl A. Note that another choice for A, which will also be useful later, is 



A(x,y,z,0 = (-5y,0,0). 



(8.1.8) 



Let m denote the mass of our charged particle, whose equation of motion O = 
(Oi, 02 , 03 ) : R R^ we turn to. For v = O' : R R^ the velocity function, 
Newton’s second law is F = mdvfdt, where in the present case F is given by the 
Lorentz formula 

F = (8.1.9) 

V X 6 is the cross product x M ^ 



(v X B)(x,y,z,t) 



i j k 

3>',(0 O)'(f) $'3(1) . (8.1.10) 

Bi(x,y,z,t) B 2 (x,y,z,t) Bi(x,y,z,t) 



One expands the determinant in (8.1.10) to calculate explicitly the components 
F\, F 2 , Ft, of F. For example, by (8.1.9), (8.1.10), 



F\{x,y,z,t) = qEi{x,y,z,t)+^ [<I> 2 (^) 53 (x,y,^,^) 
-^j{t)B2{x,y,z,t)] ; 

similar equations hold for F 2 , F 3 . In particular by Newton’s law 



( 8 . 1 . 11 ) 



q |£ i ( 0 >( f ),0 + - (< F ^( r ) B 3(«>(<),0 - 4 > 3 ( 052 ( O ( 0 , 0 )j = rn^'[{t). ( 8 . 1 . 12 ) 

On the other hand, by (8.1.1), 

E B B 

' dx c dt ’ ^ dz dx ’ ^ dx dy 

which substituted in ( 8 . 1 . 12 ) yields 



m«D';(r)-g -^(0(0,0 



-( 0 ( 0,0 



F- O'(0 



'dA2 dAi 
^ dx dy 
dAi dA3 " 



mtft) 
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Similarly 



(8.1.14) 



= q [-^(0(0,0 - i^(0)(0,0 
dy c dt 

mO"(0 = q [-^(0(0, 0 - i^(4)(0,0 
^ oz c dt 

Equations (8.1.13), (8.1.14) are thus the equations of motion of the charged par- 
ticle under the influence of the field (E, B), expressed in terms of the scalar and 
vector potential (^, A). 



Example 

Choose the scalar potential 0 = 0 and choose the vector potential A to be that 
given by (8.1.8), where we take 5 € E to be non-zero. By (8.1.1), E = 0 and 
by (8.1.8), 



dx ’ dy 




with all partial derivatives of A2, A3 equal to zero. By (8.1.13), (8.1.14) we obtain 
therefore the following equations of motion. 



/n«D';(r) = 

^ c ^ 



m<D"(0 = 

c 



mO''(0 = 0. (8.1.15) 



Then 



03(0 = erst + 73 (8.1.16) 

for constants ers, Y 3 ^ E. Define ^ and deduce from (8.1.15) that 



^2 

^ [<D',(0] = 0"'(/) = (8.1.17) 

and similarly that 

^ [<&' (r)] = 3>'"(0 = -m<5';(0 = -®'4>'2(0, 



(8.1.18) 
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which are equations of simple harmonic motion for O' ( 0 , j = 1 , 2 , with frequency 
coKItc), and with general solutions 



O;.(0 = ajCOs(cot-pj\ j = 1,2, (8.1.19) 

for constants aj, pj e E; compare equation (2.2.17). Now (8.1.15) and (8.1.19) 
together imply that 



mcoa 2 cos{(Dt - P 2 ) 



def. of CO qB qB 

= — a 2 cos(cot - P 2 ) = — 0.(0 

c c ^ 



= wO'j (0 = -mooa\ sin(cyt - ^ 1 ), 



( 8 . 1 . 20 ) 



and since B 5 ^ 0 => cu ^ 0 we conclude that for all t 



-a\ sinicot - P\) = a 2 cos{cot - j^ 2 ), ( 8 . 1 . 21 ) 

which we differentiate to get (again as 5 *^ 0 ) 

cri cos{cot - pi) = tt 2 sinicot - P 2 ). (8.1.22) 

Square bodi sides of (8.1.21), (8.1.22) and add: a\ = a\ => = ±« 2 - 

From (8.1.19) 



(8.1.23) 



= — sin((»^ - Pj) + Yj, j = 1,2 

CO 

for some constants yj € M. That is, by (8.1.22), 

<E»i(0 = — sinicot - Pi) + Yi, 

CO 

Ct2 CTi 

02(0 = — sinicot - P 2 ) + Y 2 = — cosicot - P\) + yi. 

CO CO 

so that by (8.1.16) we obtain the equation of motion (for B 7 *^ 0 in ( 8 . 1 . 8 )) 

0(0 = f ~ sinicot - pi),^ cosicot - pO^a^A + y (8.1.25) 

\ CO CO / 



(8.1.24) 



for constants a\, P\ 6 R, / = (yi, / 2 , /s) ^ R^, We see that the path of 

the particle is a helix confined to the circular cylinder = a\/co^ of radius 

\a\loo\. The cylinder is parallel to the direction of the magnetic field (the z-axis). 

One can set up a suitable Lagrangian L for the motion of a charged particle in 
an electromagnetic field (B, B). In Section 2.7 of Chapter 2 the Lagrangian for the 
motion of n particles in R^ was a map from R^” x R^" R. In the present case 
n = 1. However as 0, A are time -dependent we take L to be a time-dependent 
map from R^ x R^ x R ^ R. Namely 



L(x,y, vi,V 2 ,V 3,0 



def ni 
“ 2 




3 

- q4>(x,y,z,t) + - 2jVjAj{x,y,z,t). 
j=i 



(8.1.26) 
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Let (x, y, z), V = (vi, V 2 , V 2 ). Clearly 



dL ^ q ^ 

— {q, V, t) = mvk + -Ak(q, t), and 

dvk c 

dL ^ ^ Q xn i ~ 

7=1 



with similar formulas for Then, by the chain rule, 



(8.1.27) 

(8.1.28) 



-f 

= /wO';(f) + - ^(4>(0 , + ^(4>(t), 0«>2(0 + ^(4>(0, 04>3(0 

^ C [ ox 0_V ^ oz 

dA.\ Q dAj 

= ffiO" (0 + « f (®(t), 0 + - ^ (®(0, ol + - f ( ^ (®(0, 0 

[ox c at J ^ L \ ^3^ 

- ^(0(0,0^ 0^(0+ (^^(0(0, 0-^(<l>(0,0^‘J>3(0 • 

(8.1.29) 



Thus we see that 

I ^ (0(f), <&'(0, t) - ^ (0(t), 0'(t), t) = 0 (8. 1 .30) 

<=> equation (8.1.13) holds. Similarly 

(d)(O,4>'(O,0 - ^ (4>(0,4>'(t),t) =0 (8.1.31) 

and 

^ ^ (0(t), 4)'(0, t) - ^ (4>(t), 4>'(0, f) = 0 (8. 1 .32) 

equations (8.1.14) hold, which proves 

Theorem 8.1. (Lagrangian form of the equation of motion of a charged particle 
in an electromagnetic field {E, B) with scalar and vector potential (^, A)). 

Let L : X M? X R ^ R be the Lagrangian constructed in (8.1.26). Then the 

equations of motion (8.1.13), (8.1.14) are equivalent to the equations (8.1.30), 
(8.1.31), (8.1.32). 

One may compare Theorem 8.1 with Theorem 2.3. 
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8.2 Classical and Quantized Hamiltonian 
for a Charged Particle 

The Lorentz force F in (8.1.9) is not the gradient of some potential energy func- 
tion V. Thus one cannot set up a Hamiltonian function H as in definition (2.6.1). 
However, as pointed out in that Chapter 1 can use equation (2.7.12) there as a 
means of defining H, given L in the absence of V. Thus given (2.7.12), we are 
led to define Tf : x x M ^ M by 






def dL dL 

dpi dp2 



+ —(q,p, t)p 3 - L(q,p,t) 

dp3 



(8.2.1) 



for q = (x,y, z), p = (P\,P 2 ^P 3 ) ^ ^ ^ Using (8.1.26) and (8.1.27) one has 



H(q,p,t) = ^ (^mpk + ^Ak(q,t)^Pk - ^ pi + q(p{q, t) - ^ '^PkAkiqJ) 

k=\ k=l k-\ 



1 ^ 

= ^ '^{mpkf + q^(q,t) 



( 8 . 2 . 2 ) 



k=\ 

3 



2m ^ 



^(q,v,t) - -Ak(q,t) 
OPk c 



l2 



+ q<P(q, t). 



On the other hand, in accord with remarks of Chapter 2, taking (2.7.13) there as 
a cue, we define generalized momenta Px,Py,Pz 



dL 



dL 



dL 



Px — ^ ’ Py ~ ’ P^ ~ 

dpi dp2 dp3 



Equation (8.2.2) then takes the form 



H = 



1 

2m 



(px-^A,) +(p,-iA2) +[p,-U,) 



(8.2.3) 



-h q<p. (8.2.4) 



Apply the usual quantization rules 
h d 



Px ^ ^ 5 

I dx 



h d 

Py T— . 
i dy 



Pz 



h d 
i dz 



to quantize H: If Ma, denotes multiplication by A i we see that 



(8.2.5) 






ft d q 



- -M, 



i dx c 



Ai 



d^ had 



h q 



d . q^ 



( 8 . 2 . 6 ) 
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where 

(by the product rule) 

Px - -Ai 
c 



dx 



o Ma^ = Mdji + A\ — 



dx 



-r 



dx^ 



2q h d qh dA\ q^ 



i dx c i dx 



,2^1- 



(8.2.7) 



By similar quantization rules for [py - f ^2]^, [Pz - f we arrive at the quan- 
tized Hamiltonian H given by 



Hpf 1 

h^h = — 

2m 



- Zl- I A, ^ 



2qh 



C I 



dx 



^dy 



dz 



c i \ dx dy dzjc^ 



+ q<p (8.2.8) 



— 

dx^ 



2 ^ ^ 

for = — - H + 



see (8.2.4). 



^2 
dy^ 



»2 



Example 

Let B be the constant magnetic field given by (8.1.6). For the choice of vector 
potential A given by (8.1.7), 



dAi 

dx 



dA2 

dy 



dA3 



dz 



= 0, A j + ^2 + A3 ~ 



^ h { d d d\ h ( By d Bx d\ B^^ 

where is the quantized angular momentum operator in the 

^-direction; see (5.4.4). H in (8.2.8) therefore reduces to 



H = — 



2m 



q (P'B^ 9 9 



(8.2.9) 



For the same B of (8. 1 .6), another choice of vector potential A is given by (8. 1.8). 
For this choice 

dA\ dA2 dAs 



dx dy 



dz 



= 0 



again, but now Aj + A^ + A3 = B^y^. As A2, A3 = 0, if in (8.2.8) reduces to 



^ qh d q^B^y^ 

H - — - — I — — T~ 
2m mci dx 2mc^ 



( 8 . 2 . 10 ) 



Patrick Shanahan’s Chapter 18 will include material from this chapter with 
further discussion, especially in the direction of gauge theory. As pointed out 

in Chapter 0 (in remarks following equation (0.0.16)) the quantity co^= ^ in 
(8.1.19), for B representing a magnetic field strength, indeed does have units 
i.e., indeed m/2^ is a frequency. Compare also definition (9.3.23). 
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Appendix 8A The Electric Field Due to a Collection 
of Point Charges, Electric Potential, and Voltage 



The notion of a magnetic dipole moment will arise in connection with Paulies 
equation in the next chapter. It is appropriate, prior to then, to consider the corre- 
sponding notion of an electric dipole moment. This we take up in Appendix 8B, 
which we prepare for now. We find another application of Coulomb’s law as we 
consider the electric field due to a collection of point charges. Some elementary 
material on electric potential and voltage is also presented. 

Fix a charge q at some point A = (xi, yi, ^i) 6 M^. This charge defines a vector 
field E on the domain of points B = (x2, y 2 , zi) K as follows. The vector Eb 
at B is defined as the force that a unit positive charge at B experiences due to the 
charge q at A. Eb can be found by an application of Coulomb’s law: 



Eb 



q{(x2,y2,Z2) - (xuyuzi)} 

[iX 2 - Xi)2 + (>>2 -y\Y + {Zl - Z\9f'^ 



(8.A.1) 



thus Eb = in equation (2.5.1), with q\ there equal to q, and q 2 = 1. 

The direction of Eb is as follows. If ^ > 0, then the charge q and the unit test 
charge have the same sign; the charges are opposite if ^ < 0; see Figure 8.1. Note 
that the magnitude \Eb\ of Eb is given by 



\Eb\ = 



M 

(X2 - Xi)2 + (>>2 - + iZ2 - Z\)^ 

\q\ 



[distance from A to 



(8.A.2) 



B 




Figure 8.1: Direction of Eg 



An arbitrary charge q' at B experiences a Coulomb force 



qq' {{x2,y2,Z2)-{xi,yi,zi)} 

[te - Xi)2 + (>>2 - yi)2 + (Z2 - 



q'Eg, 



(8.A.3) 



by (8.A.1). The vector field E : B Eb is called the electric field due to the 
charge q. For ^' = 1 esu in (8.A.3) one observes that Eb is given by the units 
dynes/esu, in the c.g.s. system. 
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Example 

Find the electric field vector Eb 10 cm away from a charge q = -1500 esu. 
What force does a charge q' = 30 esu experience due to the charge ql 



Eb 



X 



0 

q = -1500 esu 



= 10 cm 



Figure 8.2: Example 



We take k = lis equation (0.0.4). By (8.A.2), 

\EB\ = —,=-^ = i5dyms/csu. 

By Figure 8.1, Eb is directed toward the origin, as illustrated in Figure 8.2. 
By (8.A.3) the charge q' at B experiences a force of 30 esu x 15 dynes/esu 
= 450 dynes due to the charge q at the origin. 

So far we have considered the electric vector field generated by a single charge 
q at the point A. Suppose now there is a finite collection of charges q\,qi,...,qn 2 X 
distinct points Ai, A 2 , . . . , Experiments have well established the following 
superposition principle: If B is a point distinct from the Aj, and q' is a charge 
at B, then the Coulomb force exerted on the charge q' due to the charges qj is 
the sum of the forces exerted individually on q' by the qj. We can construct an 
electric vector field E : B Eb away from the Aj in a manner similar to the 
case of a single charge. Namely, for a point B distinct from the Aj the vector Eb 
at B is defined as the force that a unit positive charge at B experiences due to the 
charges qj at Aj. If Ej is the electric field due to the single charge qj and Ejb is 
the corresponding electric field vector at B, then by the superposition principle 

Eb = E\b + E 2 B + • • * + EnB> (8.A.4) 

By (8.A.3) an arbitrary charge q' at B experiences a force q'EjB due to the charge 
qj and thus (by the superposition principle) a total force 

E = q'E\B + q'E2B H H Q'EnB = q'Eb, (8.A.5) 

by (8.A.4), due to the field of charges. 

Example 

Charges q\ =.35 esu and q 2 = -(.55) esu, are situated at the points Ai = 
(0,4,0), A 2 = (3,0,0) respectively. Taking the dielectric constant 1/k: = 1 in 
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z 




Figure 8.3: Multiple charges 



equation (0.0.4) we wish to compute the electric field vector at the point B = 
(0, 0, 5). Let El, E 2 denote the electric fields due to the charges q\,q 2 - By (8.A.1) 
the unit positive charge at B experiences a force 

gi(0,-4,5) .35(0, -4, 5) 

[16 + 25]3/2 262.52 

= (. 001333K0, -4, 5) = (0, 005333, . 006665) (8.A.6) 

due to qi, and similarly a force 

g2(-3,0,5) -(.55)(-3,0,5) 

[9 + 25p/2 198.25 

= -(. 00277)(-3, 0, 5) = (. 00832, 0, -.01385). (8.A.7) 

By (8.A.4) the desired electric vector Eb is given by 

Eb = Eib + E 2 b = (• 00832, -. 00533, -. 00718) dynes/esu. (8.A.8) 

By (8.A.1) the force E{x, y, z) on a unit charge at (x, y, ^) due to the charge q 
at(xi,ji,j:i)isgivenby 



for 



E(x,y, z) = {Ei(x,y,z\E2{x,y,z),Ei{x,y,z)) 



Ei(x, y,z) 
Ei{x, y, z) 





[(x - xi)2 + (y - yi)2 + (z - 

g(y-yi) 

[(x - xi)2 + (y - yi)2 + (z- 



(8.A.9) 
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Ei(x,y, z) 



qjz - zi) 

[(x - xi)2 + (y - yif + {z- 



where (x, y, z) e - { (xi , yi , zi ) } . It follows that we can immediately construct 
an electric potential energy function V : — {(xj, yi, zi)} R for the vector 

field E. Namely, 



V(x,y, 



[(x - xi)2 + (y - yi)2 + {z- Zi)^] 



1/2 



(8. A. 10) 



That is, 
dV 



dV ^ dV ^ 

= E{, = E2, = Es 

ox oy dz 



E = -VV. (8.A.11) 



In particular the work W done by the field E in moving the unit test charge from 
point Bi to point B 2 is path-independent as it is given by 



IT I £ • dr = - 1 (VF) -dr = - [V{B 2 ) - F(Bi)] , (8.A.12) 

where C is any curve from B\ to Bi. By (8.A.3) the force on an arbitrary charge 
q' at (x,y,z) £ M? - {(xuyuZi)] is q'E{x,y,z) = -q'{YV){x,y,z) = 
-(Vq'V){x,y,z), which means that the work Wgf done by the field E in mov- 
ing q' from B\ to B2 is given by 



Wg. = -\ (Yq'V) • dr = -q'[V(B 2 ) - V(Bi)] = q'W, (8.A.13) 

Jcin (8.A.12) 

A potential difference, V{B 2 ) - V{Bi) as in (8. A. 12), is called a voltage dif- 
ference, or more simply a voltage, in honor of Alessandro Volta (1745-1827) 
who invented the electric battery which thus led to initial studies of electric cur- 
rents. For q' = I esu in (8.A.13) one observes that voltage is measured in the unit 
(dynexcm)/esu = erg/esu, in the c.g.s. system, which is also called a stat-volt. The 
more familiar unit, the volt, is given by 1 volt = 1/300 stat-volt. More precisely 
1 volt= 2 ^ stat-volt. 

Given a collection of charges q\,q 2 , - ,Qn at distinct points A\, A 2 , . . . , A„, 
as before, and the electric fields E\,...,En they generate (away from the points 
Ai, A 2 , . . . , An), the force E{x, y, z) on a unit charge at {x, y, z) ^ the Aj due to 
the qj is given by (8. A. 4): 



n 

E{x,y,z) = Y,Ej{x,y,z). (8. A. 14) 

y=i 

If Vj are electric potential energy functions for the Ej (say Vj is given by (8. A. 10) 
with {x\,y\,zi) there replaced by Aj), it follows that 

E = -VF for F = 2 
;=i 



(8. A. 15) 
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and thus if IV, Wj is the work done by the fields E, Ej, respectively, in moving 
the unit test charge from B\ to Bi, one has (as in (8.A.12)) 

n n 

IV = ~[V(B2) - V(Bi)] = - J^[Vj(B2) - Vj(Bi)] = 2 ^j, 

(8. A. 16) 

with g'JV = the work done by E 
in moving an arbitrary charge from B\ to B 2 , 

in comparison with (8.A.13). If the unit test charge is brought to the point B 2 
from a distance infinitely far away CB\ = 00 ,” or each Vj{Bi) = 0 by comparison 
with (8. A. 10)) then one has in (8. A. 16) 

w = -V(B 2 ), 

which one can express as follows: 

the absolute potential V{B) at a point B equals -W = the 
work done against the field E in bringing a unit positive test 
particle from infinitely far away to the point B. 

Example 

In a container of oil with a dielectric constant 1/k = 5. 3, two charges q\, q 2 
are separated 10 cm apart; see Figure 8.4. If q\ = 100 esu and ^2 = 50 esu what 
is the absolute potential V (B) at a point B midway between q\ and ^ 2 ? 



(8.A.17) 

(8.A.18) 




Figure 8.4: Two charges in oil 
Solution: By (8. A. 10), 

100 

Vi(B) = = 3.77 stat-volts, (8.A.19) 

50 

V 2 (B) = = 1. 89 stat-volts. 

By (8.A.15), V{B) = Vi(B) -h V 2 (B) = 4. 66 stat-volts. 

Another unit of energy is the electron volt (eV), which we have already en- 
countered. It is defined as the amount of work W-e required to move an elec- 
tron through a potential difference of 1 volt. Thus in (8.A.13) and by the re- 
marks following it, we require that V(B 2 ) -V(Bi) = 1 volt = stat-volts 

= 7 ^ erg/esu, which by (0.0.3) gives W-e = erg/esu = — erg 
= 1.6021 X 10“^^ erg. Or 1 erg = ^ ” 6.2417 x 10^^ eV, as we 

have noted in Section 5.8 of Chapter 5, for example. Also see the computation 
in (5.6.5). 
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Appendix 8B Electric Dipole Moments 

An electric dipole is a system of two opposite charges q, -q separated by a dis- 
tance d, as illustrated in Figure 8.5. There the dipole moment m of the system is 
also illustrated, m being the vector with magnitude qd and with direction that of 
an arrow from the negative charge towards the positive charge. 




Figure 8.5: Dipole 



We are interested in the behavior of the dipole in the presence of an external 
electric field E; that is, E is produced by other charges (independent of q, -q). We 
assume that E is uniform: every vector Eb has the same direction and magnitude. 
Take, for example, the direction of Eb to be that of the positive x-axis as indicated 
in Figure 8.6. The field E will cause the dipole to rotate to a direction parallel to 
E as illustrated in Figure 8.6. Thus the dipole tends to align itself with the field. 

By (8.A.3) the Coulomb forces on the charges q, -q, given E, are given by 
F±, = ±qE. Their tangential components are denoted by in Figure 8.6, where 
0 -\-y = jtll. Now 



\Ff\ = |F,|cosy = |F,|sin6>, (8.B.1) 

\Ff\ = \Fg\cosr = \Fg\sin0, 

and the distances from 0 to ±q are the same and equal to d/2. The work done by 
Eg as q undergoes a small displacement As along a circular arc of radius d/2 (see 
Figure 8.7) is (with the help of (8.B.1)) 

APE+ = |f+|A5 = lf,|(sin6>)^A0. (8.B.2) 

The work done by F-g as -g undergoes the displacement As is, similarly, 

A3F“ = \F~\As = |F,|(sin6>)^A6». (8.B.3) 

We see that the total work W done by the field E as the dipole is rotated from 
some initial angle 0o to the angle 0 is given by 
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9.1 Spin Operators 

In the introductory remarks to Chapter 5 mention was made of the 1925 proposal 
of Uhlenbeck and Goudsmit that to an electron there should be assigned an intrin- 
sic angular momentum or spin S = (Sx,Sy, 5^) — this in addition to its quantum 
angular momentum (or orbital angular momentum) M = (M^, My, M^) defined 
in (5.4.4). This is not to be taken to mean that an electron actually spins about an 
axis in some physical way. Experiments (in particular the noted Stem-Gerlach ex- 
periment) have provided adequate justification for the proposed concept of spin, 
a proposal also made by R. Bichowsky and H. Urey. Dirac has shown that the 
notion of spin is necessary if quantum mechanics and Einstein relativity are to 
be consistent theories. Chemical properties of atoms, and their ability to combine 
and form molecules, the splitting of spectral lines by a magnetic field applied 
to an atom (i.e., the Zeeman effect, which we discuss later), all depend on this 
important notion. 

Proceeding in a purely formal manner, guided by analogies to orbital angular 
momentum, we show the mathematical existence of three spin operators (Sx, Sy, 
Sz), and the existence of three eigenvalues 5(5 + msK where ms = 
results which are taken to have the following meaning in conformity with known 

physical facts: (i) yj ^(^ + l)ffi = is the magnitude \S\ of the electron’s in- 
trinsic angular momentum (or spin) S, where 5 is a specific Hermitian operator 
which we construct with “components” Sx,Sy,Sz', (ii) the factor ^ in the product 
|(^ + 1 )^^ is the spin quantum number s of the electron 5 = ^; (iii) as in the case 
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of orbital angular momentum M = (M^, My, M^) one also has spatial quantiza- 
tion of S: The ^-component of S can assume only the two values msh = ±h/2 
(and, in particular, it cannot assume the value 0), which is in accord with the 
Stem-Gerlach experiment; recall Figure 5.4 and see Figure 9.1 below where 

cos^ “ M ^ ^ orientations: ^ = 54°45', 125° 15'. 

ms is the electron’s magnetic spin quantum number. 

Starting with the commutation relations for orbital angular momentum 

[M^,My] = ihM„ [My,M,] = ihM^, [M„M^] = ihMy (9.1.1) 

(see Appendix 5A) and with the eigenvalues /(/ -I- mih of + 

My + M^, Mz respectively (recalling that M\nim = /(/ + Mzy/nim = 

^by/nim by Theorems 5.2 and 5.3 where we write m = mi) we seek to construct, 
analogously, mathematical spin operators Sx, Sy, subject to the commutation 
relations 



[Sx, Sy] = ihSz, [Sy, Sz] = ihSx, Sx] = ihSy (9.1.2) 

such that S'^ 5^4- and Sz have eigenvalues s{s+l)h^, msh respectively. 

Whereas I, mi are integers with -/ < m/ < /, / > 0 we will want to have, for 
physical reasons, that in fact s = lf2 and = ±1/2, and that 5*^ has the single 
eigenvalue ^(| ± 1)^^; Sz will have ±h/2 only as eigenvalues. The construction 
of a triple S = {Sx, Sy,Sz) which satisfies these requirements is a rather easy 
task. For this we make use of the Pauli spin matrices 



def 


0 


1 ' 


def 


0 —i 


def 


1 o' 


Ox = 


1 


0 


, (^2 = 


i 0 


, (^3 = 


0 -1 



Let h{n) denote the set of Hermitian matrices of degree n: A e h{n) A is 

dcf — ^ — 

a complex nx n matrix such that A = A* = A (= A^). Consider the case n = 2. 
For A = [«^], A* = [^|] =» Aeh{2) ^ a, G M and c = ^. Writing 
A = [ a,b\,b 2 ,d eRwe see that for 1 = [ J ^ ] 

A = ^ ^ ^ 1 + bi(7i + {-b2)c>i + ^ ct3, (9.1.4) 

where of course \, 01 , 02,02 G h{2). By (9. 1 .4) we see that h{2) is a 4-dimensional 
vector space with M-basis \,o 1 , 02 , 02 . One easily checks that 

— i02, 

^ 2^3 — ~(^ 3^2 — i(^l^ ^3(^1 — —0\02 — i02- 



Define Sx, Sy, S^ e h(2) by 



(9.1.5) 
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Then by (9.1.5) the 5^, Sy, indeed do satisfy the commutation relations 

[5,, 5,] = ihS,, [5„ = ihS,, [S,, 5,] = ihSy (9.1.7) 

postulatedin (9.1.2). For example ^<y\(y 2 -^(yi(y\ = ^2(y\C2 - 

y /(J 3 = As 1 0-3 eigenvalues are exactly ±h/2. Now 

52 = Sl + S^ + si = ^{a\ + a\ + al) = by (9.1.5), so that indeed 
has the single eigenvalue + 1)S^, as desired. We thus take as the 

magnitude of S. 

The eigenvectors /± corresponding to the eigenvalues = ±| of 5^ are 
given by 

def [l] def [Ol ^ 

/+ = Q , ^ ; ^zX± = (9.1.8) 

We illustrate the spatial quantization of S as follows. Let ^ denote the angle 
between the “vectors” S^, S measured with respect to the z-axis indicated as in 
Figure 9. 1 . Note that = [^^2 o [m/i ^ 




That is, ±h/2 are also the eigenvalues of S^,Sy. By general principles (principles 
which we have already employed; see (3.9.1)), the only possible values a physical 
observable can attain are the eigenvalues of the corresponding quantum operator 
assigned to that observable. We conclude therefore that the x, y, or z-component 
of the spin (of any 1 /2-spin particle) can assume only the two values ±h/2, the 
relevant Hilbert space being H = C'^. 




Figure 9.1: Spatial quantization of S 
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9.2 Pauli Spin Wave Functions 

Taking into account various experimental results (such as alluded to earlier) which 
could not be explained by “prespin” quantum mechanics, Wolfgang Pauli in 1925 
postulated the existence of two distinct states or “orientations” of an electron. 
We represent these states mathematically by the eigenvectors /± (9.1.8). /+ is 

called the spin-up state and /_ is called the spin-down state. 

When the notion of spin is taken into account, one has in place of Schrodinger 
wave functions more general Pauli spin wave functions (also due to C.G. Darwin) 



Wp(x,y,z,t) = 



y/+(x,y, z,t) 
W-(x,y, z,t) 



y/+(x, y, z, t)x+ + W-(x, y, Z, t)x- (9.2. 1) 



satisfying the Pauli equation 

dy/p 

ih^ = Hyfp (9.2.2) 

for a suitable Hamiltonian H. 

In general, the state of a ^-spin particle (electron, proton, neutron, quark, neu- 
trino, or muon, for example) is a spinor 



X= = Zix+ + Zix- e C^ (9.2.3) 

_Z2_ 

being the Hilbert space with standard inner product structure 
Hence 

(/+. X+) = ix-, X-) = 1. (/+, X-) = 0 (9.2.5) 

which means that normalized states (x with || = 1) are given by 

kiP + k2p = l. (9.2.6) 

We concluded earlier that the only values the spin components Sx, Sy, Sz can 
assume are the eigenvalues ±h/2. Since a state 

z = ZiZ-^- + Z2X-. (9.2.7) 

where (by (9.1.8)) z± are normalized eigenspinors of Sz with eigenvalues ±h/2, 
equation (9.2.7) being the analogue of the “Fourier expansion” of T( ; t) in terms 
of the eigenvalues of Hq in (7.7.23), we conclude the following for a normal- 
ized state: If a measurement of Sz were performed, the probability that the value 
h/2 would be obtained is Izif"; similarly \z 2 \^ is the probability that the value 
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-fi/2 would be obtained. Compare the remarks made in the discussion of equa- 
tion (7.7.23), where now x± play the role of the states {cxpi-iEjt 
there and plays the role of Hq. Also compare the remarks that followed equa- 
tion (3.9.10). 

As an example, take a 5 -spin particle in the normalized state 



X = 



1 

V62 



-3 

2-t-7i • 



(9.2.8) 



Then \zi P = 9/62 is the probability that a measurement of will yield the value 
h/2, and |z 2 p = 53/62 is the probability that a measurement of will yield the 
value -h/1. 

To find the probability that a measurement of S^, or of Sy, will yield the values 
fill or -fi/2 we Fourier expand a state / in terms of the normalized eigenspinors 
[ 1 ] > ;^ [ -1 ] [ 1 ] ’ [ -i ] ) for respectively, similarly as we did 

in (9.2.7); see (9.1.9). For example, for 



Zl 


(Zi +Z2) 


\l-4l 


, izi - Zl) 


■ i/Vl' 




Vi 


i/\^ 


V2 


-1/V2 



where, similar to (9.2.5), 

\X? = = ki P + k 2 p. If X is normalized, then is the 

probability, therefore, that a measurement of will yield the value ±h/l. For 
the state / in (9.2.8), 




/ = 



-1+7/ 

V2\/62 = 2V3T 



rj_-| 

1 

LvzJ 



-5 - li 

2 V^ 



J_ 1 

1 

V^J 



(9.2.11) 



That is, the probability that a measurement of Sx (of a 1 /2-spin particle in the 
state / = 1/V62[2+7 j ]) will yield the value ^/2 is 50/124, and the probability 
that such a measurement will yield the value -^/2 is 74/124. 

Let an electron in the presence of a magnetic field B = (B\ ,B 2 ,B^) : x E 

(where each Bj R) be situated at some fixed point in R^. For this 

system Pauli’s equation (9.2.2) assumes the form 




¥+(t) 

W-(t) 



= H 



¥+{t) 
¥-(t) ’ 



(9.2.12) 



since the electron is at rest. Pauli postulated that in this case the Hamiltonian H 
is given by 
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H = Hb = • B 



(9.2.13) 



i.e. 



’ + B2(T2 + B2,02,) 



= 



^3 

B\ 4 - B 21 



B\ — JB 2 / 
-B3 



. def 

tor ^ = 



eh 

InieC 



(9.2.14) 

(9.2.15) 



by (9.1.3), where is the electron mass. The interesting factor jw = ^ in 
(9.2.15) (called the Bohr magneton — see (0.0.13), (0.0.15), in particular the ratio 
deserves some discussion. In Appendix 9 A to this chapter, we provide some 
indication of how such a ratio (called a gyromagnetic ratio — of a charge to twice 
the mass of a particle times the speed of light) arises in connection with magnetic 
dipole moments, and we indicate why Hb is defined by (9.2.13). Later, in equa- 
tion (9.3.14), it is indicated how a weak magnetic field produces an energy level 
shift proportional to the amount p. Recall that p also denotes reduced mass. This 
dual use of notation will not cause any problems. 

Take for example R to be a constant magnetic field along the z-axis: B = 
(0, 0, ^ 3 ) as in (8.1.6), where B 3 is a constant. Write B for B^. Then as 



Hb 



pB 0 
0 -pB 



equation (9.2.12) becomes the system 



ih\p’^{t) = ±pB\i/’^{t) 



(9.2.16) 



(9.2.17) 



with solutions 



W±{t) = i/A±( 0 )exp 




(9.2.18) 



Example 

Instead of a constant magnetic field, consider as a second example an electron 
at rest in a time-dependent oscillatory field given by 

B{x, y, z, t) = (0, 0, B cos cot) (9.2.19) 



for B,co €R with co ^ 0. Hb in (9.2.15) is given by 
Hb = 



pB cos cot 0 
0 -pB cos cot 



(9.2.20) 



and the corresponding Pauli equation (9.2.12) is equivalent to the differential 
equations 



ihy/'^{t) = ±pB(coscot)\i/±(t), 



(9.2.21) 



which are separable and which therefore can be immediately solved: For con- 
stants C± 
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The norm ||i/r(0|| of the state 



, , def 

wit) = 



W+(t) 

W-it) 



is independent of t: 



(9.2.23) 



llvrCOlP = IW+iOl^ + IV'-COP = |C+P + |C_P; (9.2.24) 

cf. (9.2.4). In particular, for any t, y/it) is normalized |C+|^ + |C_|^ = 1. 
Note that by (9.2.22), (9.2.23) 



¥(0) = 



c_ 



(9.2.25) 



As an application of (9.2.22) suppose the electron (at rest in the oscillating 
magnetic field B given by (9.2.19)) at time t = 0 is in the spin-up state with 
respect to the x-axis — i.e., suppose i//(0) = [ } ] ; compare remarks preced- 

ing (9.2.9). What is the probability P{t) that a measurement of the x-component 
of its spin at time t will yield the value — fijTt To answer this question we note 
that first of all the electron initially (and therefore at all times, by (9.2.24)) is in a 
normalized state: 



.(0. = - 



(by (9.2.25)) 



1 



|C+P + |C_P = 1. 
V2 

Therefore, by remarks preceding (9.2.1 1), P(t) is given by 



kl(0-?2(0P u 

P{t) = where y/{t) = 



Let i = ^. Then by (9.2.22) (since C+ = C_ = ^) 



Z\{t) 


i.e., 


w+it) 


Zlii) 




y.{t) 



(9.2.26) 



zi(t) - Z2(t) = — 

V2 

-2i 

= sin(2sina>0 

V2 

ki( 0 -^ 2 ( 0 P = 2sin^(AsincwO. 
That is, the desired probability P(t) at time t is given by 



P(t) = sin^ 



— sm cot 
nco 






eB 

InieCCO 



sin (ot 



(9.2.27) 



(9.2.28) 



by (9.2.15). 
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9.3 The Zeeman Effect for Weak Magnetic Fields 



When an atom is placed in a magnetic field, the effect is that it experiences a shift 
of its energy levels. As indicated earlier, one observes a splitting of its spectral 
lines. This effect, called the Zeeman effect, was discovered by Charles Fievez 
around 1885, and ten years later by Pieter Zeeman. The field could be weak, 
intermediate, or strong. We shall confine our discussion to weak magnetic fields. 
One could also consider shifted energy levels of an atom placed in an electric 
field, the so-called Stark effect, after Johannes Stark. 

def 

As before we consider a constant magnetic field B = (0,0, B), B eR, along 
the ^-axis. For such a field we can choose a vector potential A : x M ^ by 

A(x, y, (-^,^,0) : B = curl A. Recall (8.1.7) and also formula (8.2.9): 
The quantized Hamiltonian H of 2 l particle with charge q and mass m under the 
influence of the electromagnetic field {E, B) is given by 



H = 




_ — M. H ; 

2mc 2mc^ 



+ q(p, 



(9.3.1) 



where 



Aj -I- A2 + A3 = 

Mz = j(x~ -- y J^) is the ;^-component of orbital angular momentum, and where 

0 is a scalar potential for E: E = -V^ - J ^ = 0 for the choice ^ = 0. 

To treat the case of a hydrogenic atom influenced by B, with a single electron of 
charge q = -e and nucleus of charge Ze, we must consider (in addition to H) a 
Coulomb potential energy contribution V of the nucleus. To simplify matters we 
take the nuclear position to be fixed at the origin of the coordinates. Then away 
from the origin 

V{x, y, z) = ■■■ , (9.3.2) 

+y^ + z^ 

which gives the electrostatic Coulomb force 



Fix, y, z) = (-VF)(x, j;, z) = ■ (9.3.3) 

l(x, y, zW 

that the nucleus exerts on the electron; see Figure 5.1 where there is F here. 
To obtain Pauli’s equation for our system we also add to H Pauli’s “magnetic” 
Hamiltonian Hb given in (9.2.15). Thus we obtain the Hamiltonian 



H = 



H + V 
0 



0 

H + V 



+ Hb 



(9.3.4) 



and the corresponding Pauli equation 
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for which we seek solutions; see (9.2.2). We should point out that one further 
term could be added to H, a term which takes into account so-called spin-orbit 
interaction, which we neglect here. Finally we assume that the field B is weak, 
since in this case it is well known, empirically, that the quadratic contribution 
Aj + ^2 + A 3 to H in (9.3.1) can be disregarded in many practical situations. 
Remembering the choice ^ = 0 for the scalar potential we therefore take 




+ 



eBM, 

InteC 



(9.3.6) 



in (9.3.1). By (9.3.4) equation (9.3.5) becomes the system 

Ze^ eB 



ih-—{x,y, z,t) = 
ot 



Intp 



+ 









y,z,t) 

(9.3.7) 



on (R^ {(0,0,0)}) X R; see (9.2.1), (9.2.15), (9.2.16). If we have stationary 

state solutions \t/±{x, y, z), i.e.. 



iHxj/±){x, y, z) = Ey/±{x, y, z), E G R, (9.3.8) 



where H now denotes the operator on the right-hand side of (9.3.7), then the 
equation 



def 



-iEt 



h 



(9.3.9) 



il/±{x, y, z, t) = i//±(x, y, z) exp | 

provides for solutions i//± of (9.3.7). Accordingly, we look for solutions of (9.3.8): 
Ze^ eB 



: + ■ 



[ 2m<, ^2 Im^c 

on — { ( 0 , 0 , 0 ) ) which may be written 



(M,±h) 



W± = E\if± 



(9.3.10) 



(vV±)(x,y,z) + -r 



Ze^ 



eB 

+ E- (M, ± h) 



2meC 



y/±{x,y,z) = 0 
(9.3.11) 

on R^ - {(0,0,0)}. The point is that, modulo the term -^(M^ ± h), equa- 
tion (9.3.11) is the first equation in (5.1.9), where the reduced mass p 
there is replaced by here. Now since the y/nim in (5.2.12) are normalized solu- 
tions of the first equation in (5.1.9) for 



£ ^ £(«) -meZh* 






def 2meZe^ 



ffin 



(9.3.12) 



in (5.1.29), (5.1.30), and since 

-eB 



(Mz ± n)y/nim = (mh ± h)y/nim 

InteC ' 



-eB 

2meC 



(9.3.13) 
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by Theorem 5.3, it follows that the normalized hydrogenic wave functions y/nim 
are actually solutions of (9.3.1 1) for 

E = ± 1). (9.3.14) 

IrrieC 

This establishes the normal Zeeman effect for weak magnetic fields for the case 
at hand, where we see in fact that the field produces a shift in the energy level 
by the amount ± 1), m = m/ being the magnetic quantum number of 

the hydrogenic atom under consideration in the state y/nim- Now we see why m is 
called the magnetic quantum number. We also see, by (9.3.9), that we can obtain 
Pauli-Darwin spin wave functions y/p which solve equation (9.3.7) by setting 



y, z, t) 



¥nim(x, y, z) exp 
y, z) exp 



(9.3.15) 



on (M^ - {(0,0,0)}) X M, for given by (9.3.14). 

For nj fixed, with the field B “turned on,” the spin-up states have 

2/ + 1 energy values E^^ = E^"^ + -h 1), m = 0, ±1, . . . ,±/. Similarly 

the spin-down states have 21 -f 1 energy values E~^ = E^"^ + ^(w - 
1), m = 0, ±1, . . . , ±/. Note that for m 0 there are 2/ -h 3 distinct values 
0, ±1, ±2, . . . , ±(/ + 1) of the m ± 1 (and 2 distinct values ±1, of course, if m = 0). 
With B turned on we have a Zeeman splitting of energy levels, illustrated for ex- 
ample for the states defined by («, /) = (1, 0), (2, 1) in Figure 9.2. 



B absent B turned on rn spin 



n,l = 1,0; 
1 = 0 m = 0 






0 up 



0 down 



/ = 1 



=> ffi = 0, ±1 / 

< 









1 up 

0 up 

-1, 1 up,down 

0 down 

1 down 



Figure 9.2: Zeeman splitting in a magnetic field 
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Example 

As another example of the shift of energy levels by an imposed magnetic field 
B (the Zeeman effect) we consider a particle of charge q with the field constant as 
before along the z-axis. By (8.1.8) another choiceofvectorpotential A : R^xR 
R^ for B = (0,0, JB) is given by A{x,y,zJ) = (-By, 0,0), in which case the 
quantized Hamiltonian H is given by formula (8.2.10): 






(9.3.16) 



where m is the mass of the particle; we take the scalar potential ^ = 0. The present 
example is simpler than the preceding one of an electron bound to a nucleus. 
Here no Coulomb potential need be considered. We will consider moreover only 
Schrddinger’s equation 

= Hy/ (9.3.17) 

dt 

where i/a : R^ x R R. As usual it is enough to obtain solutions y/ of 



Hxi/ = Ey/, EeR (9.3.18) 

and set 

y/{x, y, z, t) = w(x, y, (9.3.19) 



to obtain solutions of (9.3.17). 

To find solutions of (9.3.18) propose a separation of variables of the form 



y/{x,y,z) = 

where A, C e R. Then equation (9.3.18) reduces to 



(9.3.20) 



r 0 0 n t AaBh q^B^yP' 

[i-A^ - c^)wo(y) + v'o(y)] + —^ywoiy) + \ ^ Eyr^iy) 

(9.3.21) 



or 



(f q^B^y^ AhqBy 
—z rr+ - + 



2m dy^ 2mc^ me 



Woiy) = 



E-^{A^ + C^) 
2m 



Woiy)^ 

(9.3.22) 



The trick for solving (9.3.22) is to use a change of variables. Namely, let 

def 



¥(y) 



def / hA\ 

= ¥o[y 

\ mw ) 



as in the second example of Chapter 8. Then 
hcAmco = hAqB - 



, def qB 
where co = — , 
me 



qBHA hA h^A^ 



me mo) m 



(9.3.23) 



(9.3.24) 
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Now 



^ /n 2 2 / ^ 

- (>') + y wiy) 



def 



„ / hA\ m 2 

" mco j 2 ^ 



y + — 

mco mco 



Wo y 



M\ 
mw ) 



„ / hA 

-:r^ (y 1 + 

2m \ mco 

2^ 



m 2 


(mV 2 


( hA) 


HA 


TT® 


y +mco^ 


y 


— 


2 ' 


\ mco ) 


^ mco J 


' mco 



m 2 ( ( 1iA\ 

+ — Woiy 

2 \ mco ) \ mco ) 

„ ( nA\ 

= [y 

2m \ mco J 

m 2 f ftA\^ m 2 ( ( fiA\ 

-CO [y + (oyhA - — y/o[y , 

2 \ mco ) 2 V ) V ) 



(9.3.25) 



where 



ma? ( hA 



mco 



,2>i2 



h^A^ h^A^ 
m 2m 



¥A 
2m 

qBhA hA h^A^ 

+ ’ 

me mco 2m 



(9.3.26) 



by (9.3.24), (by definition of a?) 



coyhA ■ 



by (9.3.25) 



mco^ ( __ (iBhA / 

\mco ) me \ 



hA \ 

y ) + -z — 

ma J 2m 



(9.3.27) 



^ 2 2 r s 

- :^w (y) + -j^o/y ¥(y) 



„ ( hA\ \m ( hA\ 

= -zr^o[y + tViU 

2m \ mco ) 2 m^c^ \ mco j 






qBhA 
me 

E-^iA^ + C^) 
2m 



_ h^ _2 

E-—C^ 

2m 



hA \ h^A^ 

— ) + - 7 ; — Wo 
mco ) 2m 



(y--) 

\ mco J 



(9.3.28) 



1 tlA\ 


h^A^ 


( 


1 

1 


+ ., Wo 


y 


\ ®®y 


' 2m ' 


moo J 



( 


^ 2 ] 


Wo \y = 


E-—C^ 


\ mco J 


2m 



wiy) 
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by definition (9.3.23)! That is, y/ satisfies Schrodinger’s time independent equa- 
tion for the simple harmonic oscillator with angular speed co and frequency v = 
Q)/{27r)\ see equation (3.2.3). We therefore obtain the Landau energies E given by 








(9.3.29) 



n = 0, 1, 2, 3, , and corresponding normalized L^-solutions \j/ = y/^ given in 
Section 4.8 of Chapter 4. Namely 



Wniy) = (- 1 )" 




(9.3.30) 



for 



def 2nvm com dd qB ^ o o 

b = — — = n = 0,l,2,3,..., 

n n cn 

where Hn is the nth Hermite polynomial. Note that the Landau energies 

i! = ^.(C) = 0 („ + 1) ^ (9.3.31) 



depend on the parameter C in (9.3.20) of course but are independent of the param- 
eter A there. Thus infinitely many choices of A give rise to the same energy lev- 
els in (9.3.31), a phenomenon called infinite degeneracy in physics. By (9.3.20), 
(9.3.23) corresponding stationary Landau states y/n (i.e., solutions of (9.3.18), 
i.e., of (9.3.21) or (9.3.22) for E = E„(C)) are given by 



( \ 

y-,Z) = e 



i{Ax+Cz) 



Wn 




n = 0, 1,2, 3, . . . 



(9.3.32) 



for y/n given in (9.3.30). Finally one obtains by (9.3.19) time-dependent solutions 
y/n of (9.3.17) (for H given by (9.3.16) with ^ = 0 there) by setting 

Wnix, y, z, t) = y,Jy + hA\ ( 9 . 333 ) 

\ mco ) 

for « = 0, 1, 2, 3, ... , and for arbitrary choices of constants A, C G M. 

We have considered the effect of a magnetic field on energy levels in this chap- 
ter. Another interesting study is the effect of a magnetic field on the Selberg trace 
formula. This is considered by the author in one of the papers in [16]. We take up 
applications of the trace formula in Chapters 16 and 17. 



Appendix 9A Magnetic Dipole Moments 

To have a better understanding of Pauli’s magnetic Hamiltonian H b (for example, 
how one arrives at definition (9.2.13)) we consider first how the ratio 

c p 



2meC h 
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Figure 9.3: Right-hand rule 



in (9.2.15) arises in a particular context, in regard to the notion of a magnetic 
dipole moment; cf. equation (9.A.3) below. First we replace the electron, of charge 
-e and mass m^, by an arbitrary charged particle. 

Thus, suppose we have a particle with charge q and mass m. If the particle 
moves in a circular orbit of radius r and with a constant speed v, then as its dis- 
placement Ks along the circle over a period of time hi is given by Ls = r A0, K6 
being its angular displacement over the time period At (compare Figure 1.4), one 
has “ z=r~orv = ^ = = rco where co is the (constant) angular speed of 

the particle. As before let T denote the period of the motion: given any point on 
the circle which the particle passes through at some instant, T is the amount of 
time required for the next passage through that point. Having noted in Section 1 .4 
of Chapter 1 that T - Itt I co we can now use y = rco io also write T = 2jrrfv. The 
current i due to a charge is by definition the amount of charge passing through a 
given point per unit time. Thus in our case i = qfT = qv j (2 jtr). In general if an 
amount of current I is flowing through a circuit which encloses an area A, then 
a magnetic dipole moment Ji is generated. Namely p is defined as a vector with 
magnitude /f, in the c.g.s. system, where c is the speed of light, with direction 
determined by the right-hand rule: if the curled fingers represent the direction of 
the current flow then the direction of p is that of the thumb, as illustrated in Fig- 
ure 9.3, with p normal to the plane area A. In the case at hand A = nr^ which 
means that for |/J| the length of p, 

fi _ qvjtr^ ^ qvr Jl roAn 

^ c |//| iTtrc \p\ 2c |/J|’ 

On the other hand, suppose for example the circle is in the xy-plane and is 
centered at the origin, as in Figure 9.4. As discussed in Section 5.4 of Chapter 5, 

def 

the angular momentum M of the particle is given by M = F x mv. The angle 
between F, v is ;r/2 (F • v = 0). The length \M\ of M is therefore |F|m|v| = mrv. 
Also the direction of M is that of the positive z-axis, and thus is that of p: M = 




(9.A.2) 



The ratio ^ of the length of magnetic moment to angular momentum is called a 
gyromagnetic ratio. 
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z 




V = velocity vector 
f = radius vector 



Figure 9.4: Magnetic dipole moment 



In particular, for an electron in a circular orbit, moving classically with a con- 
stant speed, as in Figure 9.4, we see that the magnetic dipole moment Ji it gener- 
ates (as a circulation charge about a circumference) is given by 

n = for n in (9.2.15), (9.A.3) 

2rrieC n 

where M is its classical angular momentum. Thus we see the appearance of the 
2tc >“ (9-2.15). 

Although we derived (9.A.3) by assuming a circular orbit for the electron, the 
same result would follow for an elliptic orbit or an orbit of any shape. Thus to an 
electron with classical orbital angular momentum M there is associated a mag- 
netic dipole moment JIm = given by equation (9.A.3), which we can write as 

Pm = gMj^ for gM = -1- (9.A.4) 

n 

There is, similarly, a magnetic moment associated to an electron with intrinsic 
spin angular momentum S\ 



= gs-^S 



(9.A.5) 



for a suitable factor gs, called the Lande (spin) g-factor. Recall that is the 
magnitude |5'| of Thus 



\hs\ = Ssd-Y' (9.A.6) 

Now the value | can be determined experimentally, and since p is known (again 

see (0.0.15)) one finds that gs is given by 



gs = -2.002319304. 



(9.A.7) 
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That is gs 7 ^ gM hut is about twice the value of gM,SL result which is known to 
follow somewhat naturally from Dirac’s relativistic quantum theory. For practical 
purposes we will therefore express (9.A.5) as 

ns = ~S = --S, (9.A.8) 

n nieC 

which is in accord in fact with the result of the 1915 experiment of Einstein and 
de Haas. 

Now we sketch how one arrives at definition (9.2.13). In Appendix 8B we 
indicated how an electric dipole with moment fh placed in an electric field E 
acquires a potential energy given by • E. Similarly an electron placed in a 
magnetic field B = B 3 ) acquires an additional potential energy given by 

-fis -B. By (9.A.8)and (9.1.6) 

—fis * B = S • B 

YYIqC 

= ^ (Bi + B2Sy + B,S,) (9.A.9) 

= (Biffi + B202 + Bm) = (na) ■ B, 

2 m eC 

which is exactly the Pauli definition (9.2.13) (or (9.2.14)). 



Appendix 9B Quantization of the Magnetic Dipole 
Moment /J 

Equation (9.A.3) of the preceding appendix provides the key to the question of 
how to assign to the observable p, a corresponding operator. Namely, as we know 
how to quantize angular momentum M by Section 5.4 of Chapter 5, it is clear that 
we should make the following assignment: 



U 



XV / X Qei 




where M = (Mx, My, M^) is given by (5.4.4). That is, 





def 


-e 


n / 


( ^ 






2meC 


7 < 






d£f 


-e 


^ i 


( ^ 


fiy 




2m eC 


7 < 


[^dx 




def 


—e 


ti j 


f d 




— 


2mgC 


7 < 


dy 




(9.B.1) 



(9.B.2) 
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Appendix 9C Construction of Higher Spin Operators 

The spin operators 5^, Sy, constructed in Section 9.1 act on a two-dimensional 
vector space. The operator Sl + Sy + S'^ was shown to have a single eigen- 
value s{s + where 5 = 1/2. Mathematically, this 5-value represents the elec- 
tron spin. There are other particles of course (protons, neutrons, quarks, neutrinos, 
for example) which also have spin 1 /2, as has been pointed out. There also exist 
in nature particles with higher spin. For example, photons have spin 5 = 1, gravi- 
tons have spin 2, “delta” particles have spin 3/2. Another particle, the so-called 
/?-meson, is known to have spin 5 = 4. Theoretically, particles can have arbitrar- 
ily high spin. Mathematically, one can construct spin operators Sx, Sy, on an 
n-dimensional vector space where n > 2 is arbitrary. This we do in the present 
appendix. It should be noted however that at the present time particles with very 
high spin yet remain to be found. The results here therefore are of mathematical 
interest more than of physical interest. 

To construct higher spin operators we apply the representation theory of the Lie 
algebra g = 5/(2, C) of 2 x 2 complex matrices with trace zero. A representation 
;r of g on some complex vector space V is simply a Lie algebra homomorphism 

def 

K \ g gl{V) = the Lie algebra of linear operators on V ; i.e., ;r is a linear map 
such that 



^([^,y]) = for X, ye g (9.C.1) 

where [x, y] = xy - yx {= matrix multiplication) and [T,S] = TS - ST for the 
two linear operators on V, n is irreducible if {0} and V are the only ;r-invariant 
subspaces of F; here we assume V {0}. g has the standard C-basis {H,E,F} 
where 



1 0 ■ 




0 


l' 




0 


0 


, E = 


0 


, F = 


0 -1 




0 




1 


0 



[H, E] = 2E, [H, F] = -2F, [E, F] = H. 

The following result is standard [51]. 

Theorem 9.1. Let m > 0 be an integer, let V be a complex vector space of 
dimension m + 1, and let [xq, x\,...,Xm] be a C-basis ofV. Then there exists an 
irreducible representation tt = jUm : g gl(V) ofg = 5/(2, C) on V such that 

7t{H)Xi = (m - 2i)Xi, ^{E)Xi = (m - / H- l)x/_i, (9.C.3) 

7r(F)Xi = (i + l)Xi+i forO <i<m 

where H, E, F are given in (9.C.2) and where we set x_i = 0 = x^+i. Conversely 
any m -h l-dimensional irreducible representation ofg on a complex vector space 
is equivalent to 

Now let V be any complex vector space of dimension n > 2. We wish to 
construct linear operators {^higher spin operators) Sx, Sy, S^ on V subject to 
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the following conditions (as in (9.1.2)): 

(0 [S,,Sy] = ms„ [Sy,s,] = ihs,, [s„s,] = ihSy, 

(//) the operator 5*^ Sy + = s(s + 1)^^1k where 

n — I 

the spin s is given by 5 = ; 1 v is the identity operator on V. 

n-\ 



(in) the eigenvalues of are ^ . 



(9.C.4) 



Thus for n = 2, 5 = 1/2 and the eigenvalues of Sz are ±h/2, as for the case of 
the Sx, Sy, Sz of Section 9.1. To begin the construction apply Theorem 9.1 to F 
where m is chosen to be « - 1: m + 1 = n : there exists an irreducible representation 
;r of g on F given by (9.C.3) for a choice of basis {Xo,xi, . . . = x„_i} of F: 



7r(H)Xj = («-!- 2j)Xj 0 < 7 < « - 1 , n(E)Xo = 0 , 

n(E)xj = (n- j)Xj^x, l<j<n-\, n(F)Xn-\ = 0, (9.C.5) 

Jt(F)Xj = (j + l)x^+i, 0<j<n-2. 

Using n: we can define linear operators Sz, S± on V by 
h 

Sz = S+ = hjr(E), S. = hn(F). (9.C.6) 

By (9.C.1) and (9.C.2) n satisfies 

[n(H), k(E)] = 2n(E), [7t(H\ n(F)] = -2n(F), (9.C.7) 

and [n(E\K(F)] = 71(H) 



which means that the operators Sz S± satisfy the commutation relations 

[Sz, 5±] = ±hS±, [S+, S-] = 2hSz. (9.C.8) 



Having defined Sz by (9.C.6) we now define Sx, Sy by 

S^ = S,±iSy\ i.e., S^= , Sy = for = (9.C.9) 

Then by (9.C.8) the operators Sx, Sy, Sz indeed do satisfy the commutation rela- 
tion (i) in (9.C.4). For example 






def [*5*^9 4" S-] hS+ — HS- 

" 2 " 2 




(9.C.10) 



Also by (9.C.5) and (9.C.6) we see that Sz is diagonalizable with eigenvalues | x 
(the eigenvalues of ;r(J9^))=the collection { | (« - 1 - 2y) } "“q , as advertised in (iii) 
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of (9.C.4). The final property to check is condition (ii) in (9.C.4). First note that 
for 5*^ + 5^ + Sj one has 

[S\ SA = [S\ Sy] = [S\ SA = 0 (9.C.1 1) 

as in (5.A.6) of Appendix 5 A, by the same proof of (5.A.6) there, given the rela- 
tions (0 of (9.C.4). Equation (9.C.11) means that S'^ commutes with Sx, Sy, S^, 
and therefore with S+, 5_, (by (9.C.9)), and therefore with n{H) 

(by (9.C.6)), and therefore commutes with all n{x), x £ g. But this means 
that must be a scalar operator 21^, A £ C, since tt is irreducible. Namely let 
A be an eigenvalue of S^ (since F is a finite-dimensional complex vector space 
7 ^ {0}). Then since S'^ commutes with k(x), x £ g, the eigenspace of A is 

;r-invariant and thus by irreducibility of tt, must coincide with F, as 7 ^ { 0 } . 

Thus indeed = Aly.To prove (9.C.4) (ii) we must therefore compute A and 
show in fact that A = ^(rfi - \), which would indeed give 



= h^s{s + 1)1k for s = (9.C.12) 

Since is a scalar operator we can find A by understanding the action of on 
a single non-zero vector v £ V. Namely we choose v = a so-called highest- 
weight vector for n, with highest weight « - 1 ; i.e., 7t(H)Xo = (n - l)Xo and 
7t(E)xo = 0 (by (9.C.5)). We have by (9.C.5) 



S+Xo = h7t(E)Xo = 0, 



S+x\ = fi7r(E)xi - h(n - l)Xo, 
S-Xo h7r(F)Xo = hx\. 



S.xi = hK(F)xi = 



2hX2 

0 



for n > 3 
for « = 2 



Then 



def 
x^n — 



‘S'xXi 



def 




for rt > 3 
for n = 2 



Similarly 



- -A 

OyXo — 2/^^’ 



ti{n-\)Xo-2hx2 

hI 

2i 



for n>3 
for « = 2 



(9.C.13) 



(9.C.14) 



iSyXj — 



(9.C.15) 
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Equations (9.C.14) and (9.C.15) imply that 






SyXo = 



- l)Xo + yX 2 for « > 3 
for « = 2 



4 I 2 

4 ■^o 



j(« - l)Xo - yX 2 for « > 3 
for« = 2 



Finally since 



^n{H)Xo = ^(« - IK 



one obtains from (9.C.16), (9.C.17) that 



,h\n-\)x, h^X2,h\ ,^2 

j(n - l)x„ + —X 2 + Y + 

h\n^ - 1) 



which also holds for « = 2. 



(9.C.16) 



(9.C.17) 



(9.C.18) 




10 

Introduction to Multi-Electron Atoms 



Some of the basic themes, or foundational concepts and principles of quantum 
mechanics (Planck’s quantization, Schrodinger’s equation, de Broglie’s wave- 
length, Heisenberg’s uncertainty principle, etc.) were introduced in Chapter 1. 
There is another basic principle, PaulVs exclusion principle, which was not in- 
troduced there but which will be considered in this chapter. This principle was 
actually postulated prior to the main developments of post-Bohr quantum me- 
chanics, and has had a spectacular impact in regards to the understanding of why 
atoms exhibit different chemical properties. The principle resolves the question, 
among others, namely of what determines the arrangements of electrons about a 
nucleus, a question which we touch on briefly, or tangentially. 

As our discussions up to this point have largely been confined to single electron 
atoms it is appropriate to now consider multi-electron atoms. We shall invade this 
vast and beautiful domain of study in only a minor and introductory way. Since 
no technology exists to solve Schrodinger’s equation in a closed form when two 
or more electrons are present in an atomic system, one must resort to suitable 
approximation schemes. We introduce first-order time-independent perturbation 
theory and apply it to approximate the ground state energy of helium, which has 
two electrons — the simplest atom after hydrogen. We move on to some general 
remarks concerning multi-electron atoms, including some on electron configura- 
tions and the principle of identity of particles. 

10. 1 First-Order Correction to Eg 

A discussion of first-order time-dependent perturbation theory was presented in 
Section 7.7 of Chapter 7. Here we consider an example of first-order time -inde- 
pendent perturbation theory. Our primary interest is an application of the latter 
theory to the computation of the ground state energy of the helium atom (or of 
helium-like atoms, more generally). 
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We begin with the problem of approximating E in the Schrodinger equation 

Hy/ = Ey/, E eR, (10.1.1) 

where H = Hg+Hi is the sum of two operators H„, H\ (a situation similar to that 
encountered in equation (7.7.5), where we assume that the equation 



HoWo = EoMfo, € M, 5^ 0, (10.1.2) 



has already been solved: Eo are known. In practice the perturbative effects of 

the operator Hi will be considered as minimal. Write 



If/ -y/g + Ay/, 
E = Eg + AE, 



(10.1.3) 



where, in some sense, the unknowns Ay/, AE are first-order corrections to y/g. 
Eg, the idea being (very roughly) that if Hi is sufficiently “small,” then H Hg 
in which case we think of equation (10.1.1) as Hgy/ = Ey/, and conclude by 
equation (10.1.2) that we should take y/ ^ y/g, provided E ~ Eg. 

Proceeding formally, we in fact define the first-order correction AE to E„ by 



def {.Hiy/g, y/g) 
i¥o,¥o) 



(10.1.4) 



The motivation for this definition is given in the following sketch. By (10.1.1), 
(10.1.2), (10.1.3), for y/i = Ay/, Ei = AE, 



Hgy/g -I- Hiy/g -I- HgXf/i -I- Hm = (Ho -t- Hi)(y/g + y/i) 

= Hy/ - Ey/ = (Eg -I- E{)(y/g -t- y/i) 

= Egy/g Eiy/g Egif/I Eiy/i (10.1.5) 

=> Hiy/g -I- Hgy/i = Eiy/g -I- EgXf/i -t- (Eiy/i - Hiy/i) 

(as Hgy/g = Egy/g) 

=> Hiy/g-\- Hq¥i ^ Eiy/g -i- Egy/i ( 10 . 1 . 6 ) 



if we neglect the terms Eiy/i, Hiy/i which are to be considered “small,” or second 
order. Now take the inner product of both sides of (10.1.6) with y/g, and bear in 
mind that generically Hg is a Hermitian operator and Eg is real: 



Eg i¥u¥o) - (¥u Egy/g) = (y/i, Hgy/g) = (Hgy/i,y/g) (by (10.1.6)) 
(Hiy/g,y/g) -I- Eg (¥\,¥o) = {Hiy/„,y/g) -I- (Hgy/i,y/g) 

= (Hiy/g -I- Hgy/i,y/g) ~ (Eiy/g -I- Egy/i,y/g) 

= El (y/g,y/g) -i- Eg{y/i,y/g) 

=» (Hiy/g,y/o) ~ El (¥o,¥o) 

(Hiy/g,y/g) 



El AE ~ 



i¥o, ¥o) 



(10.1.7) 



which we take therefore as a reason for definition (10.1.4). 
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Example 

A quantum system, an anharmonic oscillator, has potential V given by 

V(x) = Ax^ + Bx^ + Cx^ forA,B,CeM,A>0. (10.1.8) 

The quantized Hamiltonian H in the Schrodinger equation (10.1.1) in this case 
is given by (3.1.2): 



H = V{x) = Ho Hi for 

2m dx^ 



+ Ax^, 



def fr d^ 

2m dx^ 

H\ multiplication by Bx^ + Cx"^. 



(10.1.9) 



Define cw > 0 by A = oP‘m/2 for m > 0 fixed. Thus the function Vq{x) = Ax^ 
corresponds to the potential energy of a harmonic oscillator with mass m and 
frequency v = oo/{2k). Using first-order time-independent perturbation theory 
we approximate the ground state energy of the anharmonic oscillator. A normal- 
ized solution xj/o of (10.1.2) with corresponding energy-eigenvalue Eq is given by 
equation (4.8.5) and by (4.8.1): 

/ h \ h 

xi/oix)=l-j Eo = nvh=^ (10.1.10) 



for b = 2jivm/b - comlh. The first-order correction AE to the ground state 
energy Eq of the harmonic oscillator is given by the key definition (10.1.4): 



AE^= {Hwo.Wo) = 




( 10 . 1 . 11 ) 



where 



Bxh~‘’^'dx = 0 ( 10 . 1 . 12 ) 

m 

as the integrand is an odd function of x. Now by equation (4.7.59) (or 

by equation (4.7.60)) 



x^^e dx = 2 [ x^e dx = (10.1.13) 

R Jo V b 



Since b^ = co^m^fh^ 2Am^/{mh^) = 2Amfh^ we see that AE is independent 
of B and is given by 
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The ground state energy of the anharmonic oscillator is, by (10. 1 .3) and ( 10. 1 . 14), 



E = Eo + AE = 



(ofi SCfP' 
T 8Am 




(10.1.15) 



up to first-order approximation. From (10.1.15) it is intuitively clear that the 
“size” of the perturbative effect of Tfi, in the present case, is controlled by the 
size of C (since H\ is independent of A, which is regarded as fixed). 



10.2 Helium-Like Atoms: Their 
Ground-State Energies 

From equations (3.5.21), (3.5.22) the time-independent Schrddinger equation 
for the helium atom is given by 









forF(x) = -£ 



Irtle 

Ze^ 






( 10 . 2 . 1 ) 






for nio = the nuclear mass, = the electron mass, Z = 2, - (x^, yo, Zo) = 

the nuclear position, = (x/, y/, n) = the position of the ith electron, / = 1, 2, 
X = (x^"^^,x^^\x^^^), and for 



= 1 1 

^ dxj dyj dzj 



(y = 0,1,2) 



( 10 . 2 . 2 ) 



as usual. Instead of fixing the atomic number Z = 2 wc shall let Z vary, at 
no extra cost, in order to incorporate, more generally, helium-like atoms in the 
discussion. It is precisely because of the inter-electronic repulsion term 



|x0)-x(2)| 



(10.2.3) 



in (10.2.1) that an exact solution of (10.2.1) has never been found. It is known 
that no appreciable error is introduced if, as a first approximation, we consider the 
nucleus at rest, say at the origin x^^^ = (0, 0, 0), given that » m^. That is, we 
replace equation (10.2.1) by the equation 



2me 

= Ey/(x^^\ x^^^) 
2 



- — (V^ -I- V^) i^(x(’\x<2)) 7(x(»,x®)V/(x<’>,x®) 

for 

,2 






i=l 



|x^0| |x(i) - 



(10.2.4) 
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which still has never been solved exactly because the term ( 10 . 2 . 3 ) still appears. 

Towards an approximate solution of (10.2.4), or at least an approximation of 
the ground energy E = Eq, we first ignore the term in (10.2.3). Then equa- 
tion (10.2.4) reduces to 



- ^ (V? + V') 

(10.2.5) 

where 7 ^ (0,0,0), 7 ^ (0,0,0). Now equation (10.2.5) certainly has exact 

solutions. Namely, suppose we have solutions of the equations 







Eiy/^^\x^% 



Ej 6 M, i — 1 , 2 , 

(10.2.6) 



on - {(0,0,0)}. Set y/ = y/^^^ <S> i.e., 

¥(^uy\^z\,X 2 , y 2 , Z 2 ) = ¥^^\^uy\,Z\)y/^^\x 2 , y 2 , Z 2 ) (10.2.7) 



for x^'^ = (xi,yi,Zi) 7 ^ 0. Then by straightforward differentiation (similar to the 
argument which lead to the converse of Theorem 3.3) one sees that indeed y/ is 3 . 
solution of (10.2.5) for E = E\ + E 2 there. On the other hand, (10.2.6) coincide 
with the first equation in (5.1.9), where the reduced mass // there corresponds 
to nie here. Therefore for internal negative energies E\, E 2 we have normalized 
solutions y/^^^ = of (10.2.6) given by (5.2.12) (or by (5.2.15)) for 






meZ^e"^ 



( 10 . 2 . 8 ) 



as in definition (5.1.29). As usual «, /, m are integers with n > l,0</<«-l, 
I <m <1. Thus we obtain by (10.2.7) solutions 



(1) ,0. (2) 



(10.2.9) 



of (10.2.5) for 




where 



( 10 . 2 . 10 ) 



Eh 



4 

'W’ 



( 10 . 2 . 11 ) 
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that is, -Eh is the ground state energy of hydrogen, given by (5.8.2), where we 
have replaced fi =. 99me there by nie; recall (0.0.10). 

Consider in particular the lowest state y/o = 0 as we are interested 

in approximating the ground state energy of a helium-like atom. Here of course 
we must compute the first-order correction term AE in (10.1.4), where the inner 
product there is given by an integral quite a bit more complicated than that given 
in (10.1.11). If 

^ def _ + y2\ ^ ^ (10.2.12) 

2me ^ ’ 

is the Hamiltonian in (10.2.4), then clearly H - for 



/=1 ' ' 



Ji\ multiplication by 






(10.2.13) 



where 



RoMfo - EoWo for Eo = -2 Z^Eh (10.2.14) 



as we have just noted in (10.2.9), (10.2.10). Of course (10.2.4) is now the equation 



Hy/ = Eyf, (10.2.15) 

as in (10.1.1), and we are therefore in position to apply the perturbation theory 
of the preceding section. Note that the 0 in (10.2.9) (and in particular 
the state y/o) are all normalized, since by construction the variables (xi,yi,^i), 
are separated. To approximate E, to first-order, the main problem 
therefore is to compute the integral 



{Hiy,o,Vo) 






yi. ^i)l y2, Z2)\^ 

V (xi - X2)2 + (yi - + {z\ - Zi)'‘ 



dx\dyidz\dx2dy2dz2, 

( 10 . 2 . 16 ) 



which is the first-order correction to the energy Eo in (10.1.4). Using spherical 
coordinates, we can express AE as 



'2n C7C roo r2jt cn 

AE = e^ 

Jo Jo Jo Jo Jo . 



\¥m(s(r2,02,4>2))\^ 



rf sin^id^id0idrir2 sin(p2d^2d0idr2. 



(10.2.17) 
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Atom 


z 


2Z^- 1.25Z 


= (2Z2-1.25Z)13.6eV 


-Ee (eV) 


He 


2 


5.5 


74.8 


79.02 


Li+ 


3 


14.25 


193.8 


198.02 


Be2+ 


4 


27.00 


367.2 


371.42 


B3+ 


5 


43.75 


595.0 


599.35 


C4+ 


6 


64.50 


877.2 


881.69 




7 


89.25 


1213.8 


1218.56 


q6+ 

_J 


8 


118.00 


1604.8 


1610.10 



Table 10.1: Comparison of theoretical values Ef obtained in (10.2.22) with exper- 
imental values Ee of helium-like atoms 



employing earlier notation, where ru is the distance between the two electrons, 
and whereby (5.3.12) 



^ 100 ^^^ 0 ’ ^ 7 ’ ^ 7 ')) 

That is. 




for; = 1,2. 





'In 

0 



'71 
* 0 



m n 

0 



exp 

.0 



2^. 

(ri +r2) 

a„ 



12 

sin^\dd)id6\dr\ sin ^2 ^02^02 ^'’2 

r \2 



(10.2.18) 



(10.2.19) 



for 




( 10 . 2 . 20 ) 



That is, in the definition of Qq in (5.3.1) we replace fi there by m^, in accord with 
the remarks preceding (10.2.8). Although the iterated integral in (10.2.19) is not 
quite so simple it can be evaluated; see Appendix lOA to this chapter, and the 
General Appendix D following Chapter 18. The end result is that 



AE = 



5Ze^ 

Sao 



SZrtiee'* 






( 10 . 2 . 21 ) 



by (10.2.11), (10.2.20). 

By (10.1.3), (10.2.10), (10.2.21), we obtain the main result of this section: the 
ground state energy E of a helium-like atom is given by 



E = +AE = -{2Z^ - ^Z)Eh 

= -(2Z^- 1.25Z)13.6eV, 



( 10 . 2 . 22 ) 



up to a first-order approximation. Obviously it is of interest to compare the theo- 
retical result (10.2.22) with known experimental results compiled by John Slater. 
This is done in Table 10.1; cf. [78, pp. 339-342]. 

Sometimes energy is expressed in terms of Rydberg units, where 1 Rydberg 
= 13.6 eV. Thus for helium, for example, —Ee = =5.81 Rydbergs. Even 
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though the perturbation operator H\ in (10.2.13) is not necessarily “small” the 
relative errors (Ef — Ee)jEe in Table 10.1 are small. For Z = 2, 3,4, 5,6, 7 , 8 
they are -.05340, -.02131, -.01136, -.00725, -.00509, -.00390, -.00329, 
respectively. Thus in absolute value the percentage errors range from about 5.3% 
for He to about . 33% for which shows fairly good agreement between theory 
and experiment. 

10.3 Pauli’s Exclusion Principle and Electron 
Configurations 

We have seen that because of inter-electron interactions the calculation in closed 
form of energy levels and wave functions of atoms with Z electrons, Z > 2, is vir- 
tually a mathematical impossibility. In addition to electronic interaction Coulomb 
forces one must also deal with certain magnetic spin-orbit forces (Russell-Sanders 
L-S coupling). 

Various approximation methods have been devised for multi-electron calcu- 
lations. A simplifying assumption (one which indeed is subject to further re- 
finements) is that each electron is considered to move independently under the 
Coulomb influence of the nucleus and under the average Coulomb field of the 
other Z - 1 electrons. In this central field approximation (introduced by John 
Slater in 1929) all magnetic interactions are neglected. The total wave function 
of the atomic system will be a product of Z one-particle wave functions (as the 
electrons move independently) corresponding to a spherically synunetric poten- 
tial. As in Chapter 5 one can assign to each electron a set of four quantum numbers 
(n, /, m/, which describes its state — the state being the corresponding individ- 
ual wave function, which is also called an orbital. By calculations similar to those 
of Chapter 5 one finds that the angular momentum quantum number (or azimuthal 
quantum number) / and the magnetic quantum number m/ are the same as those 
for the hydrogenic atom. The electron spin quantum number rris {ms — \ or — 5 ) 
also is the same as for the hydrogenic atom. However the (generalized) principal 
quantum number n will be different from the n in (5.2.6) which helped to label the 
radial wave function there. This is because now the potential in the radial equation 
is more general than the simple Coulomb potential. The energy levels will depend 
on n and /, and not just n alone as in the hydrogenic case. 

Concerning the quantum numbers (n,l,mi,ms), Pauli in 1925 enunciated the 
following striking principle, after analyzing an array of spectroscopic data for 
multi-electron atoms: 

No two electrons in an atom can have an identical set of quan- qq 3 1) 

turn numbers. 

This exclusion principle, which is to be taken as axiomatic in quantum mechanics, 
has lead to a basic understanding, for example, of the shell structure of atoms, and 
to key facts concerning the chemical valence. Our discussion prior to its statement 
was a bit sketchy since in fact we plan to present a more general formulation of 
this fundamental principle. 
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Electrons with the same principal quantum number n are said to form a shell. 
With n fixed one can consider electrons with the same n and the same azimuthal 
number /. This collection of electrons forms a subshell of the shell defined by n. 
With n and / fixed there are 2/ + 1 possible values for mi (-/ <mi< /), and two 
possible values for This means that the maximal number of electrons that can 
occupy a subshell, without violating Pauli’s exclusion principle, is 2(2/ + 1); this 
fact is known as Stoner's rule. For example for « = 2, / = 1, only the following 
states are possible: (2, 1,-1, 1/2), (2, 1,-1, -1/2), (2, 1,0, 1/2), (2, 1,0, -1/2), 
(2, 1, -1, 1/2), (2, 1, -1, -1/2). For n = 2 and / = 0 we must have mi = 0. Then 
only the states (2, 0, 0, 1 /2), (2, 0, 0, -1/2) are possible. Since each / gives rise to 
a maximum number of 2(2/ + 1) states, for n fixed, there can be at most 



n-\ 



^2(2/ + l) = 2 
= 2 



/=0 



n-l n-l 

2Z' + 2i 

1=0 1=0 

^ . (n- l)n 

2 0+ ^ ^ ■ 



(10.3.2) 



+ n 



= 2n^ 



electrons occupying a shell. This number is 8 in the preceding example. 

It is customary to employ, alternatively, x-ray spectroscopic notation: One writes 

shell letters K, L, M,N,0,P,. . ., 

for the n numbers 1, 2, 3, 4, 5, 6, ... , ^ 

and subshell letters s, p, d, /, g,hJJ,k, . . . , 
for the / values 0, 1, 2, 3, 4, 5, 6, 7, 8, ... , 

where the s stands for “sharp,” p for “principal,” d for “diffuse,” and / stands 
for “fundamental.” A d subshell for example (where / = 2) can accommodate 
2(2/ + 1) = 10 electrons. 

There is a tendency for a physical system to acquire its lowest energy state. 
Thus for the hydrogen atom, the lowest energy state, or ground state, is given by 
n = 1, as we have seen (the corresponding energy being about -13.59 electron 
volts). Chemists, and others use the notation in (10.3.3) to describe the electron 
configuration of atoms in their ground state. For nitrogen (N), with Z = 7, this 
configuration is \s^ 2s^ 2p^ which means the following: reading pairs 1^^, 2s^, 
2p^ from left to right with superscripts denoting the number of electrons with 
quantum numbers n, /, one takes \s^ to mean there are two electrons with quantum 
numbers « = 1, / = 0 (since s corresponds to / = 0, by (10.3.3)); similarly 2s'^ 
means there are two electrons with quantum numbers n = 2,1 = 0, and 2p^ means 
there are three electrons with quantum numbers « = 2, / = 1 (since p corresponds 
to / = 1, by (10.3.3)). The Z = 1 {= 2+2+3) electrons thus are all accounted for. 

As a second example consider oxygen (O), with Z = 8. Its ground state elec- 
tron configuration is given by 1^^ 2s^ 2p^, which we translate to mean that there 
are two electrons with quantum numbers n,l = 1,0, there are two electrons with 
quantum numbers n,l = 2, 0, and there are four electrons with quantum numbers 
n,l = 2, 1. Here we see that the K shell (n = 1) is filled or closed since the 
maximum number of electrons it can hold is 2n^ = 2. The L shell {n = 2) is not 
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closed since it contains six (but not eight) electrons. For a third example, we take 
the sodium atom (Na) with Z = 11 electrons (and therefore 11 protons) and 12 
neutrons (n). Its ground state electron configuration is 1^^ 2p^ 3^^ One can 

picture this atom by way of the shell diagram in Figure 10.1. 




Figure 10.1: Shell diagram for Na 

There are 2 -1- 6 = 8 electrons in the L shell (n = 2), which is thus closed; 
two of these electrons have quantum numbers n,l = 2, 0 and six of them have 
quantum numbers «, / = 2, 1. There is one electron in the M shell with quantum 
numbers nj = 1,0, and there are two electrons in the K shell with quantum 
numbers nJ = 1,0. The fact that the outer shell of sodium has a single electron 
(that is, its valence number is 1), whereas the outer shell of the chlorine atom 
lacks exactly one electron to close it (its valence number is -1) ultimately relates 
to the ability of these two atoms to combine to form sodium chloride (NaCl), 
ordinary table salt. The valence number is a measure of the number of electrons 
that an atom is willing to give up, accept, or share during a chemical reaction. 
For further details with regard to the example at hand we start with the electron 
configuration 1^^ 2s^ 2p^ 3s^ 3p^ of chlorine. Clearly Z = 2-f2-l-6-l-2+5 = 17. 
One has the shell diagram depicted in Figure 10.2. 

In the M shell there are two electrons with quantum numbers n, I = 3,0 and five 
electrons with quantum numbers n,l = 3,1. Recalling that the subshell defined by 
n, I can accommodate up to 2(2/ + 1) electrons we see that the 3, 1 subshell of the 
chlorine atom (which can accommodate six electrons but only has five) is shy one 
electron, as remarked above. The sodium atom will readily give up (or lend) its 
one outer valence electron which the chlorine atom will readily accept to close its 
outer subshell. In the process sodium acquires a positive charge (having given up 
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Figure 10.2: Shell diagram for Cl 

an electron), and chlorine acquires a negative charge (having gained an electron). 
These oppositely charged particles attract each other to form sodium chloride. 

In the electron configuration, again as one reads left to right, one notes the 
increase in electron energy (since n increases). Table 10.2 gives some additional 
electron configurations for a few light atoms. 

The exclusion principle and the classification of electronic states by their quan- 
tum numbers (the latter being based on an approximation, as we have seen) pro- 
vide quite adequate information and means for explanation of the Mendeleev pe- 
riodic table (after Dimitri Ivanovitch Mendeleev (1834-1907)); see General Ap- 
pendix B following Chapter 18. Elements in the table, apart from a central list 
of transition and rare earth elements, are organized into eight groups, a group 
being a column of elements with quite similar chemical properties, because they 
have similar electron configurations. Elements are also organized into periods, 
which are rows where the atomic number Z increases by one as one reads from 
left to right. Electrons first fill the lower energy shells. When these are filled (in 
the lighter atoms hydrogen, helium, lithium, etc.), higher energy shells start to be 
filled. One notes the outermost shells being filled as one reads across a row (again 
from left to right). By this building-up principle the table (with heavier elements 
being listed) is generated. The theoretical construction of the Mendeleev table, by 
Bohr in 1922 (also by Mainsmith and Stoner) was at the time one of the great and 
marvelous achievements of quantum theory. 

At the quantum level, some particles (electrons, protons, neutrons, for example) 
are utterly identical. They are indistinguishable in the sense that they cannot be 
distinguished by some measurement. All electrons have the same mass, the same 
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Atom 


Symbol 


Electron Configuration 


Hydrogen 


H 


l5^ 


Helium 


He 


l5^ 


Lithium 


Li 


Is^ 2s^ 


Beryllium 


Be 


Is^ 2s^ 


Boron 


B 


Is^ 2s^ 2p^ 


Carbon 


C 


Is^ 2s^ 2p^ 


Fluorine 


F 


Is^ 2s^ 2p^ 


Neon 


Ne 


Is^ 2s^ 2/ 



Table 10.2: Ground state configurations 

spin and charge, the same magnetic moment, etc.; “if you have seen one electron 
you have seen them all.” There is a principle of identity of particles from which 
one can derive many fundamental and profound consequences. We consider it in 
relation to a more general formulation of Pauli’s exclusion principle. As we shall 
see, there is imposed on the wave functions of systems of identical particles either 
a symmetry or asymmetry feature. A formal statement of the principle of identity 
of particles, one suitable for our purposes, is the following: 

The state of a system of identical particles cannot change when 

two of them are interchanged. That is, if the coordinates and 

spin of two particles of the system are interchanged the wave (10.3.4) 

function of the system can change only by some phase factor 

e^^, GeR. 

We proceed mathematically to gain a more practical understanding of this prin- 
ciple. Suppose there is a system of n identical particles; i.e., they all have the same 
mass, the same charge, the same spin, etc. (assuming they have a charge or spin, 
for example). Let H denote the quantum Hamiltonian of the system. Then ifPjk 
denotes the exchange or transposition operator defined by 

def 

(Pjkf)(qi, = f{qi, . . . ,qk, . . . ,qj, . . . ,q„,t), j^k 

(10.3.5) 

(here one may think of qj as the position of the yth particle at time /), one requires 
that 



HPjk = PjkH for all j, k,j 5^ k. (10.3.6) 

Schrodinger’s equation for the system is of course 

ih^ = Hw. (10.3.7) 

In (10.3.7) one could also take i/a to be a spinor. From (10.3.6) it is clear that ify/ 
is a solution of (10.3.7), then so is PjkW^ since 

d ^ ^ dw 

j/j'^Vr = Pjk-^. 



(10.3.8) 
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That is, after the exchange of the 7 th and A:th particle Pj^y/ represents the new 
state of the system. (10.3.4) requires that 

PjkW = Xy/ for some A £C with |>l| = 1. (10.3.9) 

As Pjj^ = 1 we deduce that A^ = 1, or A = ±1. That is, the wave functions of 
indistinguishable particles are either symmetric {A = 1) or antisymmetric {A = 
-1). Particles with symmetric wave functions are called bosons (or Bose-Einstein 
particles, after Satyendranath Bose and Albert Einstein), and particles with anti- 
symmetric wave functions are called fermions (or Fermi-Dirac particles, after 
Enrico Fermi and Paul Dirac). 

Pauli’s exclusion principle, in a more general form, is the assertion: 

The wave function for a system of identical particles having 
half-odd-integral intrinsic spin (for example a multi-electron ^ 

system) must be antisymmetric with respect to the interchange v • • ; 

of any two of its particle’s coordinates. 

Here “coordinates” include spin parameters occurring in the wave function. Ex- 
periments have established that particles with half-odd-integral spin are indeed 
described by anti-symmetric wave functions, and moreover particles with integral 
spin are described by symmetric wave functions. 

Consider, for example, the Hamiltonian H in (10.2.12) for helium-like atoms. 
If / is a function of six variables (x^^\x^^^) = Z\,X 2 ,y 2 ,Z 2 ), then by 

direct computation 



V?Pi2/ = Pi2V^/ ^ = Pn^lf, (10.3.11) 

where the implication in (10.3.11) follows since P ^2 = 1. Clearly PuV = V for 
V in (10.2.4). Thus indeed 

HP 12 = PnH\ (10.3.12) 

i.e., H satisfies (10.3.6). Also 

HoPu = PnHo. (10.3.13) 

As in (10.2.9), (10.2.10) we have a solution 

def 

¥n = ¥mhm,®¥n2hm2 (10.3.14) 

of (10.2.5) (which is the equation Ho¥ = P¥) for 

E = e["'^ + (10.3.15) 



¥21 = Pu¥l2 



see (10.2.8). By (10.3.13) 



( 10 . 3 . 16 ) 
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is also a solution of (10.2.5) for the same E as in (10.3.15). Of course the so- 
lutions \f/i 2 , y/ 2 \ are not anti-symmetric, but an anti-symmetric solution y/ can 
immediately be constructed from them: 

¥ = Wn-¥2U i.e., 

(10.3.17) 



Now if («i, /i, mi) = (« 2 , ^ 2 ), then by (10.3.17) we see that y/ = 0. That is, if 

the anti-symmetric wave function y/ is non- vanishing, then the states («i, /i, mi), 
(« 2 , h, 1 ^ 2 ) of the two electrons must differ: we must have («i, /i, mi) ^ (« 2 , h, 
m 2 ). Thus, disregarding spin considerations (for simplicity), we have presented 
an indication of why an anti-symmetry version of Pauli’s exclusion (statement 
(10.3.10)) implies the version (10.3.1). 

One can view the (somewhat mysterious) fact that nature allows only for sym- 
metric or anti-symmetric wave functions in the description of indistinguishable 
particles as being related to the mathematical fact that the symmetric group on 
n letters has exactly two one-dimensional representations (up to equivalence), 
where is the trivial representation and where 7t~ is the sign representation 
which is trivial on even permutations, but acts as -1 on odd permutations. Recall 
the discussion in Section 7.1 1 of Chapter 7. First note that since each o e Sn is a. 
product of transpositions one has that (10.3.6) holds for every o e Sn. 



Ha = cH 



(10.3.18) 



where 



(<7/)(Xi,X2, ...,x„) = /(X(,-i(i),X„-i(2),...,Xff-i(„)). (10.3.19) 

Hence (by (10.3.8) again) if 1 //^ is a (non-zero) solution of (10.3.7), so is ay/ 
and equation (10.3.9) (which followed by the principle of identity (10.3.4)) now 
becomes 



ay/ = A(j¥ for some G C with = 1. (10.3.20) 

By (10.3.19), (crio- 2 )/ = which means that equation (10.3.20) defines a 

one-dimensional representation jt of on the space Cy/ spanned by y/: jr(a)y/ 

def 

= ay/. By the above remark, n - 01 n~.\i n = then y/ = ay/ \/a e 

Sn y/ is symmetric. If ;r = 7u~, then in particular as every transposition 

is an odd permutation we have (by definition of 7t~) -y/ = Pijy/ => y/ is 
anti-symmetric. Thus the one-dimensional trivial representation of the symmetric 
group Sn corresponds in some sense to bosonic particles of nature, whereas the 
one-dimensional sign representation corresponds to fermionic particles. 
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Appendix lOA On the Integral in (10.2.19) 

As was shown in Section 10.2 the first-order pertubative analysis of the ground 
state energy of a helium-like atom depends mainly on the computation of the in- 
tegral I in (10.2.16) (or in (10.2.19)). There are various known methods of com- 
puting /, including one which uses an infinite series expansion of r \2 in terms 
of spherical harmonics. Although the latter method is elegant, we shall indicate 
how the computation of / follows by some well-known Fourier transform com- 
putations. Since these Fourier transforms are known in an arbitrary dimension n 
we will in fact compute, more generally, an ^-dimensional version of /, for the 
record. Namely, let 

. f f r p-a{\x\+\y\) 

/„ = dxdy for a >0, 0 < p < n. (lO.A.l) 

Jr” ^ 

Then we can show 




Thus we can focus on Theorem 10.1. Its proof will be deferred for now. See the 
General Appendix D following Chapter 18. 
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Fresnel Integrals and 
Feynman Integrals 



In Part II of this book a transition is made as certain selected topics are consid- 
ered. Among these are path integrals, zeta functions, and some applications of 
the Selberg trace formula. In some cases it will not be possible to develop all of 
the relevant background material. Rather, the intent will be to present some of 
the key ideas and, as in Part I, to present some concrete computational examples. 
Later we shall compute, for example, certain Feynman path integrals. For such 
computations one needs the value of certain Fresnel type integrals, integrals of 
independent interest. We begin with a discussion of the latter integrals. 



11.1 Fresnel Integrals 

The basic Fresnel integrals are the integrals 



sin dx, 



cos x"^ dx. 



(11.1.1) 



Jo Jo 

We review the computation of these integrals, i.e., of the integral 

poo 

Qxp(iAx^) dx, A>0, (11.1.2) 

Jo 

and then show how to compute a much more complicated type of Fresnel integral 
of the form 



00 poo 

• • • exp (/ {A [(xi - af + (x 2 - xi)^ + (xs - X 2 >^ + • • ■ + (x„ - x„-if 

— 00 J— oo 

+ {b - Xn)^] -}r p(x\ + • - + Xn)]) dX\dX2 • - dx„ ( 11 . 1 . 3 ) 

for 2 > 0, fl, Z), // G R, « > 1 an integer; 



see Theorem 11.3. 
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Figure 11.1: Closed contour 



Let / be the holomorphic function z exp(/Az^) for A > 0 fixed. Let C = Ci + 
Cl + C3 be the closed contour of Figure 11.1, where C2 is a portion of the circle 
about the origin of radius r. Parameterize C\ by x{t) = t, y{t) = 0, 0 < ^ < r, 
and parameterize --C3 by x(t) = t, y{t) = t,0 < t < r cos(;r/4) = r/V2. Then 
z(t) = x{t) + iy(t) = t, z'(t) = 1 on Cl and similarly ^(0 = t + it, z\t) = 1 + / on 
~C3 



=[' 
[/(.)*=-[ 



fiz) dz = \ exp(/Ar") dt, 

fiz) “ I 



(11.1.4) 



r/V2 



exp(/At^(l + /)^)(1 + i) dt 



C3 



rl^fl 



(11.1.5) 



= -(1 + 0 

By Cauchy’s theorem 



exp(-2At^) dt. 



0 = f{z) dz 



=1, 

=1: 



exp(iAl^) dt + 



fiz) dz-{\ + i 



-i: 



•/V2 



( 11 . 1 . 6 ) 



exp(-2Ar) dt. 



We will show that (for i > 0) 



lim / iz) dz = 0. 



(11.1.7) 



Assuming (11.1.7) for now we see that exp(iAt^) dt converges and has the 
value 

exp(-x^) dx 



(1 + 0 



exp(— 2Ar) dt = (I + i) 



VtJ 

(I + i) Vjc _ exp(f 

V2l 



(;r/4) Hr 

2 V A' 



( 11 . 1 . 8 ) 
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Figure 1 1.2: A lower bound for the sine 



Parameterize C 2 by x{9) = r cos 6, y{6) = r sin 0 < ^ < ;r/4. Then 

f f^/4 

f{z)dz= exp(/ir^exp(/2^)) • /rexp(i^) (11.1.9) 

Jc 2 Jo 

I r I 

/(z) cfz < exp(Re(iir^exp(/2^))) • r dO 

I Jc 2 I Jo 

r^/4 

= r exp(-Im(Ar^ exp(i‘2^))) dO (11.1.10) 

Jo 

r/r/4 

= r exp(-2r^ sin(2^)) dO. 

Jo 

From Figure 11.2 we see that sinx > 2x/;r for 0 < x < ;r/2. Therefore for 
0 < 6> < ;r/4, sin(20) > A9 /k -kr^ sin(26>) < -kr^^^|K (for A > 0) => 

'n|^ rjijA 

exp(-ir^ sin(2^)) d9 < exp(-Ar^4^/;r) 

Jo Jo (11.1.11) 

= [1 - exp(-Ar^)]— ^ -^0 as r -> 00 , for A > 0. 

AXr^ 

Thus by (11.1.10), equation (11.1.7) is established, and by (11.1.8) we have 
proved 

Theorem 11.1. For A > 0, 

r , 

exp(/Ax ) dx converges 

Jo 

and has the value 

1(1 + 0 

2 

In particular the Fresnel integrals 

roc poo 

cos(Ax^) dx, sin(Ax^) dx exist 

Jo Jo 
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and have the common value 



1 ^ 



To make further progress we consider the process of uncompleting the square. 
Let q{z) = -az^ + + / be a second degree polynomial function where a,p,y e 

C, a ^ 0. Let z* be the point where q'(z) vanishes: q'(z) = -2az + P ==^ 
Z* = p/2a. Note that q{z*) = -a(^/2cr)^ + P(P/2a) + y = p^/4a + y. Then 
a{z-Z*f = a(z^-2zl}l2a+p'^l4a'^) = az^-zl)+{9iz*)-Y} = -qiz)+q{z*);i.t., 

Proposition 11.1. 



for 



, *,2 -qiz) + q{z*) 

(z-z) = 

a 



or 



qiz) = q(z*) - a{z - z*f 



y X def 9/1 * def P , ^ 

q{z) = - az^ + Pz + Y, Z = —, a 7^0; 

2a 



( 11 . 1 . 12 ) 

(11.1.13) 



(11.1.14) 

(11.1.15) 



here a,fi,Y € C. 

In equation (11.1.13) of Proposition 11.1 we have “uncompleted the square.” 
As an example, for x,a,b & R, Ai, /I 2 > 0, —y + A 2 i>^, let 



q(x) = — (Ai + ^ 2 )x^ + 2{aA\ + bA 2 )x + y 

- -Ai(x - a)^ - A 2 (b - x)^ - -ax^ + Px-\-y 

for a = A{ + A 2 , P = 2{aA\ + bA 2 ). Then 

^ (M P __ + M 2 ) 

2a {A\ + A 2 ) 

By (11.1.13), (11.1.15), and (11.1.16) 

Ai(x - af' + A 2 {b - x)^ = -q{x) = -q{x*) + a(x - x*f 
= - q{x*) + (Ai + h)ix - x*)\ 



(11.1.16) 



(11.1.17) 



(11.1.18) 



where 



9(x*) = 



drf 4(aAi + bAz)^ 
4« ^ " 4(Ai + A 2 ) 

-AiAzia-b)^ 



- A\c^ - A2I? 



Ai + A2 



(11.1.19) 
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Therefore 



exp(/[Ai(x - of + Ajib - x)^]) dx 



= exp 






A2{a - by 
A\ + A2 



expOXAi + X 2 ){x - x*f) dx. (11.1.20) 



For a continuous function / on R we take I = / (x) dx to mean 



poo pO 

j f(x)dx=j 



fix) dx = I fix) dx + 

fO 



fix) dx 



fU 

def 

= lim 



/ (x) dx 4- lim 



(11.1.21) 



fix) dx 



in this section. If both limits in (11.1.21) exist, i.e,, if / converges, then of course 
I coincides with the Cauchy principal value (CPV): 

poo rr 

/ = CPV fix) dx = lim fix) dx. (11.1 .22) 

J_oo 

Now for r, 5 e R, A > 0 



rO r—S 

exp(/A(x - S)^) dx = expiiAu^) du 

i-r J-r-S 

rO c—6 

= exp(/iw^) du + exp(/A«^) du 

J-r-5 Jo 






r+6 I — 6 

cxpiiAt^) dt + exp(ziM^) du (for t = -u). 

0 Jo 



By Theorem 11.1 we see that 



(11.1.23) 



exp(/A(x - S)^) dx exists 



(11.1.24) 



and has the value 



ei^/4 ^ c-i 



exp(iAw^) du. 



Similarly 



f 



rr-5 

'I' 



exp(/A(x - 6)^) dx = | exp(/2«^) du 

■5 

fO rr—6 

2 \ _±_ I 1 ^2 



exp(i2x‘^) dx + exp(i/lx^) dx 

5 Jo 



(11.1.25) 



(11.1.26) 
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exp(/2(x - 5)^) dx exists and has the value 



* 2 

exp(/ix^) dx + — — 

5 2 






(11.1.27) 



Expressions (11.1. 25), (11.1 .27) taken together give 
Corollary 11.1. For 5 G R, A > 0 

r 2 

exp(/A(x - S) ) dx converges 

J-oo 

{see (11.1.21)) and has the value 







which is independent of 5. 

By Corollary 11.1 and equation (11.1.20) we obtain 
Corollary 11.2. For a, ft G R, Ai, A 2 > 0 



•00 

exp(/[Ai(x - a)^ + A 2 (ft - x)^]) dx converges 

-00 



(see (11.1.21)) and has the value 

,XiX2{a-bf 



rH, 

\ Ai + A2 / Y Ai + A2 



Proposition 11.2. For a, ft G R, a > 0, c G C 

exp(/[flx^ + bx + c]) dx converges 

5 

(see (11.1.21)) and has the value 



poo 

J-o 




Proof Define q{x) = -ax^ - ftx - c for x G R. By Proposition 11.1 ^(x) = 
^(x*)-fl(x-x*)^ forx* = -ft/2fl, where ^(x*) = ft^/4fl-c. Then /(flx^+ftx+c) = 
-iq{x) = -i[q{x*) - a(x - x*)^] => 

poo poo 

exp(/[ax^ + ftx + c]) dx = exp(-/^(x*)) exp(/a(x - x*)^) dx. (11.1.28) 
J-oo J-oo 



Hence Proposition 11.2 follows by Corollary 11.1. 



□ 
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The next theorem generalizes Corollary 1 1.2. 
Theorem 11.2. For a,b,iiE R, A\,A 2 > 0, 



exp(/[2i(x - af' + A 2 {b - x)^ + fix]) dx converges 



(see (11.1.21)j and has the value 






ti{A\a + A2b) + X\X2{a - b)^ 



4 Ui+ 22 ) 



A\ + ^2 



) 



Proof. A\{x-a)^+X 2 {b-x)^-\rtix = {A\-\-A2)x^-{-{n-2A\a-2X2b)x+X\a^-\-k2b^. 
Therefore by Proposition 11.2 



exp(/[2i(x - a)^ + 22 (^ - + pix\) dx converges 



and has the value 

JnjA 



exp 






(jW - 2X\a — 2X2b)^ 



4(2i + 22) 



- {Xia^ + 22 ^ ) 



X\ + X 2 

which is the same value claimed in the theorem. 

Corollary 11.3. For 2 > 0, a, Z?, G R 

exp(/{2[(x - a)^ + {b- xf'] + ^x}) dx converges 



□ 



and has the value 

iKl\ 






u(a + b) X 2 



def 



For Z>, G R, 2 > 0 we look at J 2 = 

exp (/ {2 [(xi - af + (X 2 -xif -^{b- X 2 f] + //(xi + X 2 )}) dx\dx 2 

0 

exp(/[A(i - xif + HX 2 ]) 
exp(/{A[(xi - a)^ + (X 2 - xi)^] + ^x\}) dx\ 

—00 

exp(/[/l(i - X 2 )^ + 



— 00 
fOO 



dx2 (11.1.29) 



— 00 
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^expl, 



^ + X 2 ) >l 



'si* 



— + j(o-»2)" 



I dX2 



by Corollary 1 1.3. The latter integral is 









^ifia/2 



3fi 



X 2 ) + - ^2) + -7r^2 r dX2 









• exp I i 



[4(A + A/2) 
(by Theorem 11.2). That is, 



-iWf ^lKia + Ab) + jA(a-bf 



A + 2/2 



- 



/2 = e' 






-fi^ 2 A 

— + -M« + ^>)+3(« 






(11.1.30) 



which is a special case of the following more general result. 
Theorem 11.3. For A > 0, a, b, ^ e W, n > I an integer the integral 



00 00 

= I • • • I exp (/ {a [(xi - af + {xi - xif + (X3 - X2>^ 

— 00 —00 

+ --- + (x„-x„_i)^ + (6-x„)^] + n{x\ + ---+x„)}) dxidx 2 ---dx„ 
converges ( see (11.1.21)) and has the value 






-n{n+\)(n + 2) y 



482 



nu(a + b) A(a - 
+ — - + 



S?))' 



where for n = I we take x„_i to mean a. 



Proof The result is established for « = 1 in Corollary 11.3, and for « = 2 
in (11.1.30). We proceed by induction: 

poo poo 

Jn+i = • ■ • exp(/{A[(xi - af + (x 2 - xi)^ 

J— OO J— 00 

+ • ■ • + - X„-\f + (x„+l - X„f + {b- Xn+\f] 

+ /r(xi + • • • x„ + x„+i))) dxi • • • dx„dx„+i 
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exp{i{A(b - Xn+lf + fiXn+l]) 



exp(/{yl[(xi - of + (x 2 - xif 



+ • • • + (X« - X„-lf + (Xn+l - X„f~\ + //(Xi + • • • X„)}) ^/Xl • • • dx„]rfx„+i 



exp(i {X{b-Xn+if + (ix„+i } ^ 



(n + 1)A" 



• exp I i 



(by induction) 



-n{n-\- l)(n + 2)fi^ n 



X{a-Xn+if 



= 



(«+l)A« 



+ -//(flf + x„+i) + ^ ^ 



-/w(«+l)(n + 2V \ 

48/1 r 



(11.1.31) 



exp I i 



i < - ^r,+lf + ^{b - X„+if + // ^1 + ^)xn+l /• j dx„+i 



= e'”V4^ 



(n+l)A« 



-in{n + l){n + 2)n'^ \ 



(by Theorem 11.2) 



— ... 






4(A + ^) 



Ml+n/2)(^ + A6) ;^/l(a-6)^ 

4r + /l 4r + i 



i(n+l)7c/4 



(n + 2)>l«+i 



exp I i 



-n(n + 1)(« + 2)/^^ //^(« + 2)^ 

, 48l 4-4(^)A 



^i{n + 2)X{-^-\-b) nfia )?{a-b)^ 

^ 2(gf)4 ’^~^(«+l)(Sf] 

f / « . 1 \ 



a„expl/|-(«+l)(n + 2)-l- + - 



2 « + 2 



exp i 



-{n + l)(n + 2){n + 3)^^ // 

481 '^2 



X^ia - by 



+ ~(n + l)(fl + b) + — 

2 n + 2 



Jin+l)7c/4^ 



(n + 2)2«+i ’ 



(11.1.32) 



which completes the induction. 
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11.2 Feynman Path Integrals 

In the year 1947 Richard Feynman completed a new formulation of quantum me- 
chanics by means of his path integral. Seeds for this equivalent formulation were 
sown and developed in his 1942 Princeton thesis [31]. The original paper on the 
path integral, the 1948 paper [32], is considered by some to be his finest work. 
Feynman extended his path integral methods to cover quantum electrodynamics, 
quantum statistical mechanics, and other areas [33]. 

The path integral / = I(ta, 4, x^) in words is a “sum over histories” — a sum 

over all paths x(t) of a particle from Xa = x(ta) to Xb = x(tb) for a time interval 
[ta^tb], where each path is weighted by ^'*^0(^(0)/^^ Sc(x{t)) being the classical 
action for the path. A more analytic expression of such “words” will shortly fol- 
low. A quantum system can be equally well described by specifying its quantum 
Hamiltonian H (as we have done heretofore), or by specifying /. Physically, the 
modulus squared \I\^ determines the probability of arrival of a particle at position 
Xb and time tb from its initial position Xa at time Our purpose here is not to 
develope the path integral approach to quantum mechanics. We want to compute 
I in three concrete cases and then consider in Chapter 16 a more complicated type 
of path integral (a Hawking path integral [45]), whose meaning we should like to 
clarify using an appropriate zeta function. Applications to Kaluza-Klein space- 
times will be given. It should be pointed out that in contrast to Wiener integrals 
(or Euclidean path integrals which we take a look at in Chapter 13), Feynman 
integrals are usually plagued by difficult issues of mathematical rigor regarding 
their meaning — issues that we shall not attempt to resolve here. 

In Schrodinger’s formulation of quantum mechanics the underlying classical 
mechanics is Hamiltonian. Feynman’s formulation has underlying ties to La- 
grangian classical mechanics. We begin our discussion therefore of action inte- 
grals for a path; compare definition (2.7.20). 

Consider a particle of mass m under the influence of a potential F, say in one 
dimension for simplicity. The motion during a time interval ta to tb (ta < tb) is 

def def 

represented by a path x(t) from Xa = x(ta) to Xb = x(tb) where jc = x(t) is the 
equation of motion. 



X 







x{t) 


^ (tb, Xt) 












(ta,Xa) 






0 


ta 




tb 



Figure 11.3: Path of particle 
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The action integral S{x{t)) for the path x(t) is defined by 



^(x(0) = 




^x'itf - V{x{t)) 



dt 



( 11 . 2 . 1 ) 



where the integrand corresponds to the Lagrangian L of (2.1.1), One could think 
of S(x(t)) as evaluation at x(t) of an action functional 5 on a space of paths. 
We shall have interest, in particular, in the case when x(t) is a straight line. For a 
free particle, for example, V = 0 and Newton’s equation 0 = ma = ==» 

X = x(t) = ct + d is the equation of motion — a straight line. Here Xa = eta + d, 

X{) — etjj “h d y 



d = , 

^b ~ 




Xb-Xa 

C = 

^b ~ 

tbXg - tgXb 

^b ~ 



( 11 . 2 . 2 ) 

(11.2.3) 



is the straight line path from Xg to Xfj. For such a path the action integral in 
(11.2.1) is 



iW,)) = f 12^ 

2 tb-tg 



V(x(t)) dt 



where the second term in (1 1.2.4) is zero for a free particle. 
The Feynman path integral, which we denote by 

fXb 

I(ta,tb,Xg,Xb) 






exp ( -S(x(t))J D[x(t)l 



(11.2.4) 



(11.2.5) 



(the dependence on V being suppressed) is formally an integral over the space 
of all continuous paths x(t) from Xg to Xb with each such path assigned a weight 
exp(^*S’(x(r))). Here (tg, Xg), (tb, Xb) are now fixed points in with tg < tb. As 
there is no suitable measure D[x(t)] on the space of continuous paths we attempt 
to provide I(tg, tb, Xg, Xb) a meaning by first taking polygonal paths (which may 
be considered as discrete approximations of continuous paths) and then by taking 
a suitable limit. We proceed as follows. Given an integer n>2 choose a partition 
Pn = {to^ ti,t 2 , . . • Jn} of [tg, tb] with each subinterval [tj, tj+i], 0 < j < n - I, 

def 

of equal length e = €„ = (tb - tg)/n: tg = tg, ti = tg + e,...,tj = tg + je, 
0 < j < nUn = tb- Select real numbers Xj for each j, 0 <j <n, such that Xg = x^, 
Xn = Xb, and connect (tj, xj) to (tj+i,Xj+i) by a straight line path xj(t) to obtain 
a polygonal path x^"^(0 from (tg, Xg) to (tb, x*) as illustrated in Figure 1 1.4. 

As 



Xj(t) = Xj + 



0+1 “ 0 



(t - tJ) 



the action integral of Xj(t) is 



S(Xj(t)) = 



m (xj+i - xjf 
2 0+1 - 0 



+y+i 

•0 



( 11 . 2 . 6 ) 



V(Xj(t)) dt. 



(11.2.7) 
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X 




similar to (1 1.2.4). For the full polygonal path 



Six^"\t))='£s(xj{t)) 

j=0 



2 n-l 



_ ^ (Xj+l ~ Xj) ^ 

2 ^ tj+i — tj ^ j. 

y=0 ^ 



tj+\ 



( 11 . 2 . 8 ) 



V{xj{t)) dt. 



Assuming that V is continuous, the mean value theorem for integrals gives 



fO+i 



V(xj(t)) dt = V(xj(^j)){tj^i - tj) for some e (tjJj+i). (11.2.9) 



For large n, tj+i ~ tj = 6n = (4 - ta)ln is small and consequently V{Xj{^j)) is 
reasonably approximated by V{Xj(tj^\)) = F(X;+i): in (11.2.9) 



fO+i 



V{Xj{t)) dt ~ V{Xj+i){tj+i - tj) for large n. 



( 11 . 2 . 10 ) 



Of course one could consider other types of approximations as well. Given (11 .2.8), 

( 11 . 2 . 10 ), 



n-l 

S(x^"\t)) ~ 2 
y=o 



m 



— (Xy+i - Xj) - CnViXj^i) 
Z€n 



( 11 . 2 . 11 ) 
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At this point, engaging in a bit of pseudo-mathematics perhaps, we consider 
the right-hand side of (11.2.11) as a function of « - 1 variables xi,X2 • • • 
Denote this function by and form the Fresnel-type multiple integral 

In(.ta,tb,Xa,Xi,) = C„^ •••| tXp dXidXj ■ • ■ dx„.u 

( 11 . 2 . 12 ) 

for some suitable constant c„ ^ 0. Now define the Feynman path integral /(/„, tb, 
Xa, Xb) in (11.2.5) by 



def 

I{ta,tb,Xa,Xb) = \im I„{ta,tb,Xa,Xb), (11.2.13) 

n^oo 

provided the limit exists. 

Let’s analyze this definition in the case of a free particle: V = 0 => 

iS{x^'’\t)) def mi r 2 , , ^2 , 

n leh ^ ~ ^^2 - 2Ci) + • • • + 

+(x„_, - x„_2)' + (Xb - x„_i)2] . (11.2.14) 

In Theorem 11.3 choose /< = 0, i = 5^ and conclude from (11.2.12), (11.2.14) 
that 



In(t ay Hy ^ay Xb) 

i{n - l);r\ ^ 

n 



= Cn exp 






2cnfi 



m 



exp 



( m 
^ lenhn 



(Xb - XaY 



(11.2.15) 



= Cn 



2mh(tb - ta) 


n/2 


m 


( im(Xb - Xa)^ \ 


nm 




Inihitb - ta) 


2h(tb-ta) ) 



by the choice = exp(/;r/4) and the definition = tb- ta> It is now clear that 
if we make the choice 

1 n/2 



def 
Cn ~~ 



mn 



2mh(tb - to) 



for example, then 

^nitay t by Xay Xb) ^ 



m 



1/2 



[2mh(tb - ta) 
and in particular, by (1 1.2.13), we obtain 



exp 



im(xb - 



2Ktb 



- Xgf \ 
-ta) ) 



(11.2.16) 



(11.2.17) 



Theorem 11.4. For the normalization constants Cn in (11 .2.16), the Feynman path 
integral I{ta, tb, x^, x^) in (11.2.13) for the free particle of mass m is given by 



I(taytbyXa,Xb) — 

Here i^!^ = exp(/;r/4). 



m 



2nih(tb - ta) 



1/2 



/ im(Xb - Xa) 
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By (1 1.2.4) we note that equation (1 1.2.18) can be expressed as 



I(ta,tb,Xa,Xb) 



m 

2mh{tb - to) 




(isw.))) , 



(11.2.19) 



where S{x{t)) is the action integral of the classical path x{t) from Xa to Xb of the 
free particle, with x(t) given by (1 1.2.3). 

To understand better the meaning of Theorem 11.4 consider Schrodinger’s 
time-dependent equation 



dll/ hi d^y/ 

dt 2m dx^ 



(11.2.20) 



for the free particle. Define a kernel function Go of (x, t), x eR,t > 0, by 



= 






V 2 ntih 

— 



V4;rta 



. def itl .1/2 

fora = —,i' = e 
2m 






( 11 . 2 . 21 ) 



By direct computation: 

Proposition 11.3. Go is a solution of Schrodinger's equation in (11.2.20). 
We see that the function Go is the analogue of the heat kernel 



Hoixj) 



def exp(-^) 

\lAnta 



xeR 

t >0 

a>0 



( 11 . 2 . 22 ) 



which solves the heat equation 



dy/ ^ d^y/ 
dt dx^ 



(11.2.23) 



Now define a Green’s function G by 



def . 



G(x, t\ y, s) = Go{x - yj - s) for x,y,t,s eR,t > s 

li/2 /,w ^2\ (11.2.24) 



def 



exp 



\2nih(t - s) 

Then by (11.2.18) we see that for the free particle 



/ imjx - yf \ 

V 2tiit-s) )' 



I(tailbi ^b) — G(^Xb, tb, Xq, to), (11.2.25) 



which relates the Feynman integral to a solution of Schrodinger’s equation. A key 
point about the Feynman integral I (i.e., the Green’s function G) is that (as can 
be shown) it relates a wave function at two different times: 

¥(xb, h) = G{xb, tb\ X, ta)y/{x, to) dx, 

Jr 



(11.2.26) 
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which is Huygens' principle. 

As a second example, a case with V ^ 0, choose V (x) = mgx, where g is a 
constant: 



iS{x^^\t)) 

h 

def mi 



hlCn 



[(xi - Xa)^ + (X2 — X\)^ H h (x„_i - Xn-l)^ + (Xft - Xn-lf' 



- T^«mg(Xi + • • • + X„_i) - -CnmgXb. 

n n 



By definition (11.2.12) and Theorem 11.3, 

( i \ (i{n-\)K\ 

= Cn exp I --CnmgXb 1 • exp I I \ 

(n-l)n{n+l) 9 (n-l)p 

-n + — ^ — 



(11.2.27) 



B-/1— 1 



n-\ 



• exp 



('{- 



. , def m 

for A = 



48A 

def €nmg 



nX' 



{Xa + Xb) + 



i(Xf, - Xaf 



2he„’ 



def 



H = - 



h 



(11.2.28) 



Again use that CnU = tb-ta> 

(n - l)n{n +l)p^ (n^ - \)2helm^g^ 



48A 



(tb - ta) 



4Smffi 
(n^ - l)elmg^(tb-ta) 

24h 

n^elmg^itb - ta) elmg^ih - to) 



(11.2.29) 



24^ 



24% 



mg^{tb-ta? . elmg^{tb-ta) 



24% 



+ 



24% 



Also 



{n-\)p def {n-\)enmg 

(Xa + Xb)= — (Xa + Xb) 

2 2% 



. €rrmg(Xa-\-Xb) 

= -{tb - ta)^{Xa + Xb) + — , 

2 % 2 % 



(11.2.30) 



and 



X 2 def ^ . .2 

-(Xb-Xa) =— (Xb-Xa) 

n 2%n€n 



mjxb - Xg)^ 
2%{tb - tg) ■ 



(11.2.31) 
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Thus by (11.2.29), (11.2.30), and (11.2.31) 

(«-!)// 



exp^/|- 
= exp 
• exp 



('{- 



482 

mg^itb - ta? 



+ ■ 



{Xa + Xf)) + 



k{xb - 



24h 



mg(Xa + Xh) m{Xb 

- (4 - 4 ) + 



2h 



mg^itb - ta) e„mg{Xa + Xj) 



24h 



2h 



}) 



^}) 



(11.2.32) 



By definition of A, e„ 





1 „n-l _ 


2mh(tb - ta) 


n/2 


m 


V 4 ) 


y 


nm 




2nih{tb - ta) 



1/2 



(11.2.33) 



for the choice = exp(i;r/4). For the same c„ defined in (11.2.16) we see 
by (11.2.28), (11.2.32), (11.2.33) that 



Ia(t Xb) 

= exp 



• exp 

• exp 



(-ie„mgxb^ 

('{- 



m 



ll/2 



2Kih{tb - to) 
mg^itb - ta)^ 



U . ^tnsiXa + Xb) m(xb- 

- (tb - ta) TT f 



2h 



2h{tb 



■m 



mg {tb - tg) ^ e„mg{Xa + x*) 



2h 



})■ 



(11.2.34) 



As n Qo, €n ^ 0. Hence 

Theorem 11.5. For the choice of normalizing constants c„ given by (11.2.16) 
and for the potential V(x) = mgx, g a constant (such as the acceleration due to 
gravity) the Feynman path integral I{ta, tb, Xa, xt) is given by 



1(^05 ^a» ^b) 



m 1 f ij 

2Kih(tb-ta) 



mg^jtb - tg)^ 
24 



(Jb la) 



mg{Xa + Xb) 



+ 



mjxb - Xg)^ \ \ 
2(tb-ta) jJ 



(11.2.35) 



Again we take i^^^ = exp(/;r/4). 

Suppose g is the acceleration due to gravity so that, as discussed in Chapter 2, 
a freely falling object of mass m experiences a force F of magnitude -mg. Label 
the vertical axis now by x. With F directed toward the origin V(x) = mgx in 
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Theorem 11.5 is the corresponding potential: -V\x) = -mg. The equation of 
motion IS -mg = m^,ov 



gr 

x{t) = -^^At + B. 



Then for Xa = x{ta), x^ = 

Xb-Xa = -^{t] - tl) + A(4 - to) 
^b-Xa . g, 



A = 






Similarly 



gt^tb 

hXa = — ^ + AtJb + Btb and 



gt\ 

taXb = ^ Atbta + Bta 



B = 



taXb - tbXa 



gtJb 



ta tb tb) 



By definition (11.2.1) 

fti 

Six{t)) 



■n 



Vixit)) 



dt 



j(-gt + Af - mg{-^ + At + B) 



dt 



(11.2.36) 

(11.2.37) 

(11.2.38) 



(11.2.39) 

(11.2.40) 



(11.2.41) 



= - fngAitj - tl) + 



( mA? \ 

( mgB\ (tb-ta). 



Using the values of A, B given in (11.2.38), (11.2.40) one obtains, after some 
messy algebra. 

Proposition 11.4. For the potential V (x) = mgx with g the acceleration due 
to gravity the classical path x(t) from Xa to Xb is given by (11.2.36) with A, 
B given by (11.2.38), (11.2.40), and the corresponding action integral S(x{t)) 
of (11.2.41) is given by 



c/ rn{Xb-Xaf 



mg 

- -Y(^b + Xa){tb-ta)- 



mg^jtb - tgf 
24 



(11.2.42) 



Given (11.2.42) and Theorem 11.5 we can express the path integral I(taJb^ 
Xa,Xb) in (11.2.35) by 



I(taJb,Xa,Xb) 



m 

2nih(tb - ta) 



1/2 



exp 



(^i^(x(0)) . 



(11.2.43) 



This result is similar to the result (1 1.2.19) obtained for the free particle. 
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Appendix 1 1 A Gaussian Integrals 

We shall also need the following Gaussian version of Theorem 1 1.3. 
Theorem 11.6. For n = 1,2,3, ... ,a,b eR, A > 0 



00 00 






-00 -00 



converges and has the value ^ ^ • For n = \, x„_i a. 

Proof. The starting point is the fact that 

1°° e-^^"dx = or 1“ £-'<"-“>' dx = (11.A.1) 

for a G M, by the translational invariance of Lebesgue measure dx\ cf. Corol- 
lary 11.1. By equations (ll.A.l), (11.1.18), (11.1.19) 

/li(x - af + kiib - xf = -q{x*) + {Ai + /IzXx - x*f (11.A.2) 



-[M{x-a)^+A2(b-x)^]^^ _ ^q{x*) 



-kik2ia-b)'^ 
=I g A] +A2 



g-a,+A2)(x-x‘)^^^ (11.A.3) 



Ai + A 2 



for Ai, /I 2 > 0. In particular for ii = A 2 = A > 0 



ro 



e-^l(^-“r^(»--y^dx = J—e-^ 

2A 



(11.A.4) 



That is, Theorem 1 1 .6 is true for « = 1 ; we proceed by induction. 

00 00 

^n+l 

—00 —00 



00 00 

= ‘ I • • dXndXn+x 



by induction 






(«-h 1)A” J 



gn+1 



{a-Xn+\)-k{b-Xn+\f 



^^n+1 



(«+1)A«Y^ + A 



e 



by (ll.A.3).Thatis, 

I 

^«+i - 



fr«+l 



(n+l)A"(^)A 
which completes the induction. 



-A(g-Z>)^ 

e "+2 zz 



fr«+l 



(n + 2)A«+i 



-A(a-b)'^ 

e "+2 



(11.A.5) 

(11.A.6) 

□ 
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The next result generalizes (ll.A.l). 

Theorem 11.7. Let Abe annxn real symmetric matrix with positive eigenvalues, 
and let a > 0. Then 



. R" V 



cr" det A 



(1LA.7) 



for Lebesgue measure dx on W, x = a row vector. 

Proof As A is symmetric we can choose an orthogonal matrix B 3 

\Ai o' 



BAB~^ = 



0 






(11.A.8) 



where the Aj are the eigenvalues of A. In general, if ^ : R" ^ M" is a non-singular 
linear operator on R", then 



/ o d>dx 



R« 



_ 1 * 
Idet^l Jk- 



fdx 



(11.A.9) 



is a familiar property of Lebesgue measure dx, for any nice function /. Choose 

[A, O' 



4>{x) = xB', fix) = e 



0 x„ 



-a 2 Xjxj 



= e 



forx = (xi,...,x„)6E". (II.A.IO) 



Then f(4>ix)) = and |det(^>| = |detJ?'| = 1 by (ll.A.lO), since B is 

orthogonal. (11 . A.9) gives 



[ e~“^^^'dx = [ • [ 

JR” JR JR 







(ll.A.ll) 



A\a Y V a'^detA’ 

where we have used (1 l.A.l). □ 

We have noted that (1 l.A.l) is the Gaussian version of Corollary 11.1 where 

f(x)dx = lim [ f(x)dx-\- lim [ f(x)dx (11. A. 12) 



as in (11.1.21): 



dx 






e‘^u-6?dx = 



(11.A.13) 
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for 2 > 0, 5 G R. Using (11.A.13) in place of (ll.A.l) one can deduce that 




(11.A.14) 



for a, A in Theorem 1 1.7; 
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On the other hand, 



Xb = A sin(cotb + 4>) = A(sin cotb cos ^ + cos cotb sin ^), 
Xa = A sm{(0ta + 0) = A(sin cot a cos + cos cot a sin 



(12.1.3) 



Xb cos cot a — Xa COS COtb — (sin cotb cos cot a ~ sin cot a cos cotb) A cos 4> 



= [sinco{tb - t a)]A cos 



and 



Xb sin cot a - Xa sin cotb = (cos cotb sin cot a - cos cot a sin cotb) A sin ^ 
= [sin cw(ra - f^,)]Asin0 ==> 



(12.1.4) 



(12.1.5) 



Xb cos — Xa COS cotb 

Acos4>= ; ; ; and 

sm co(tb - ta) 

. XaSinCOtb-XbSinCOta ... , / n 

Asin0 = ^ ; , if smc0{tb - ta) 0. 



( 12 . 1 . 6 ) 



sinfti(r^ - ta) 

Then using (12.1.3), (12.1.6) and sin 20 = 2sin0cos0, one sees that 

A^ sin(2cotb + 2 ^) = A^2 sm(cotb + (p) cos(cotb + p) 

= 2Axb cos{cotb + p) = 2Axb[cos cotb cos p - sin cotb sin p] 
\ Xb cos cot a — Xa COS COtb 1 



= 2xb COS cotb 



— 2xb sin cotb 



sinco{tb-ta) 

Xa sin cotb — Xb sin cot a 



sinco(tb - ta) 



2x1 cos cot a - 2XaXb + 2x^ sin cotb sin cot a 

sinco{tb - ta) 



(12.1.7) 



Similarly 



At sin{2cota + 2p) = 



-2x1 cos cotb + 2XaXb “ 2x^ Sin cot a Sin cotb 

sinco{tb-ta) 

=> A^[sin(2cotb + 2p) -sin(2cota + 2p)] 



( 12 . 1 . 8 ) 



2(xl + xl) cos cot a COS COtb ~ AXaXb + 2(x^ + xl) sin cot a sin cotb 



sin co(tb - ta) 



2(xl + xl) cos CO(tb - ta) - AXaXb 



(12.1.9) 



sin co(tb - ta) 

Given (12.1.2), (12.1.9) we have thus shown 



for sinco(tb - ta) 0. 
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Proposition 12.1. For the simple harmonic oscillator with frequency v = col In 
the action integral S{x(t)) of (12.1.2) for its equation of motion x(t) (see (12.1.1)j 
is given by 



S(x(t)) = ^ 



{xl + xl)C0tw{tb-ta)- 



2xqX}} 

^mco(tb-ta) 



(12.1.10) 



for sinm(^^, ~ to) 0. 

Given equations (11.2.19), (11.2.43) one might naturally wonder whether the 
path integral for the simple harmonic oscillator also has the form c(ta,tb^ 
m) exp(^5(x(0)) for S(x(t)) given by Proposition 12.1, where c(ta, tb, m) is some 
constant depending on ta,tb, and m, and depending possibly on co also. This in- 
deed is the case, as was first shown by Feynman: 

Theorem 12.1. For the simple harmonic oscillator one has for sin co(tb -ta) > 0 



I(taJb,Xa,Xb) = - —QXp[^S(x(t))] (12.1.11) 

\l 2nihsmco(tb-ta) \h ) 

for S(x(t)) given by Proposition 12.1. The normalizing constants Cn in (11.2.12) 
are again given by (1 1 .2. 16). 

A direct proof of Theorem 12.1 is long and cumbersome. A simpler, somewhat 
elegant, proof exists which is based on the consideration of the deviation Xd(t) of 
a path x(t) from the classical path xdt) given by (12.1.1): 



Xd(t) = x(t) “ Xc(t) = x(t) - A sin(mt + 0) for ta <t < tb. (12.1.12) 

As we always consider paths from Xa to Xb we have the boundary conditions 

Xc(ta) = ^a = X(ta), Xc(tb) = Xb = x(tb) (12.1.13) 

Xd(ta)=Xd(tb)=0. (12.1.14) 

Of course xdt) satisfies 

x"(0 = -w^Xcit). (12.1.15) 



The next proposition is simple, but is a bit remarkable. From it one also sees 
why the factor exp(^5(x(0)) occurs in (12.1.11) (which also occurs in (1 1.2.19), 
(11.2.43), as has been noted); see (12.1.19) below. 

Proposition 12.2. 



S(x(t)) = S(xdt)) + S(xd(t))^ 



(12.1.16) 
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Proof. By definition 



S{x{t)) 



■CK'*'- 






dt 



mco 



W + x'/0f-^[x,(0 + Xd(0] 



} 



dt 



'tb 

'ta 



m 



x!,{tf 



mco^ 2 

-IX-Xcit) 



dt 



(12.1.17) 









dt 



0) Xc(t)xd(t)] dt 



= S(xc(t)) + S(xd{t)) + ml. 



where I denotes the latter integral which we shall see is zero. Namely, integration 
by parts provides an alternative expression of /: 

I = [x[(t)xj{t)]['' - [ ‘ x'Xt)xj{t) dt-V co^Xc(t)xAt) dt = 0 (12.1.18) 

“ Jta ita 

by the boundary conditions (12.1.14) and the equation of motion (12.1.15). □ 



Proposition 12.2 means that (as is intuitively clear) when computing the inte- 
gral in (1 1.2.5) over paths from Xa to xj, (a sum over paths) one can factor out the 
contribution exp(^S(Xc(t))) of the classical path and sum over paths of the form 
Xd(t) with Xd(ta) = Xditb) = 0: 

D[x(0], 

(12.1.19) 

which simplifies matters quite a bit. Here we take the integral on the right-hand 
side of (12.1.19) to mean the limit of ln{ta,tb,0,0) as n ^ oo, as in (11.2.13). 
Thus we must consider Fresnel-type multiple integrals in the special case Xa = 
Xb = 0. Again define = (tb - ta)/n for an integer n> 2. Define the (« - 1) x 
(n - 1) matrix by 



/ / \ ( 
I(ta, tb, Xa, Xb) = exp -S(Xc(t)) 1 exp I 

/ Jx(ta)=0 \ 



An 



( 



2 - 
-1 



-1 
2 - 



V 



0 



0 



\ 



2-e„W -1 

~1 2-elco^ / 



( 12 . 1 . 20 ) 



Recall that S(x^^\t)) defined as a function of xi, X 2 , . . . , x„_i is given by 
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n-\ r 

= y ^{xj+i - xjf - e„V{Xj+x) 

,-n 



(12.1.21) 



X ^{Xj^l-Xj) 



9 fnuj 9 






with Xo = Xa,Xn = X^. 

Proposition 12.3. the boundary conditions Xa = Xb = 0 one has 



«-i 



[xi, . . . ,x„_i] An : = 



(12.1.22) 



Proof. The left-hand side of the equation is 



x\(2-elo?)-2xiX2+- ■ •+x^_2(2-e^m'^)-2x„_2X„_i+x^_,(2-e;^®^). (12.1.23) 
On the other hand, because = x„ = 0 the right-hand side is 

2(x]+i - 2X;Xy+i + x]) - elw^ 2 ^y +1 = ^1 + ^2 + • • • + 4-1 



■ 2 X + Xj + X 2 + • • • + x^_j - + x^ + • • • + x^_,), 



(12.1.24) 



which is the left-hand side, as desired. 



By (12.1.21), (12.1.22) we can write 



S(x^'‘\t)) = —xA„x' for X = (xi,...,x„_i) 
2e„ 



(12.1.25) 



which puts us in position to apply (1 1.A.14) in Appendix 1 lA, where we take a 
there equal to ml {Icnh); note that is indeed symmetric; moreover we will have 

det 0 at least for n sufficiently large: 



In(taJb,0,0) = c„exp 



InCnhi 



for Cn given as usual. That is. 



/;r(n-l)^ \7t2enh 
4 ) m 

-jl/2 

[ detA „]-‘/2 



[detA„]-'/2 



(12.1.26) 



def mn 

Cw — T TT": 



\2mh{tb - to) 



(12.1.27) 



To complete the proof of Theorem 12.1 we appeal to the following proposition. 
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Proposition 12.4. For a real number c with -1 < c < 1 form the nx n matrix 

0 ^ 



/ 2c -1 
-1 2c 



M„(c) = 



(12.1.28) 



Write c = cos u. Then we can write 



det M„(c) = 



2c -1 
-1 2c / 



sin(n + l)u 



sinw 



(12.1.29) 



We assume this result for now. Noting that by definition (12.1.20) we can write 
An = M„_i(l - elciP'fl), we conclude by Proposition 12.4 that 



sinn«„ elco^ 

dQt An = — : for 1 — = COSM„ 

sin Un 2 



which by (12.1.26) gives 

InitaJbA0) = 

Now 



sin^ Un = I - COS^ Un = elco^ - 



m 


1/2 


sin nUn 


IjihHti - to) 




n sin Un 



- 1/2 



(12.1.30) 



(12.1.31) 






(by definition (12.1.30)) (12.1.32) 



2..,4 



0.1 y \0 0 0 i.^b ~ ^a) ^ 

sin^ Un = {h - ta) o) - e; 



(12.1.33) 



(tb - as n 00 . 

Of course 0 < «„ = arccos(l - elco^fl) arccos 1 = 0 as n ^ oo. Thus if 

f(x)^= sinx/x with /(O) 1, then /(«„) -^ 1 so that we may choose a natural 

number N such that n > N => |/(w«) -II < 1/2. In particular n > N ==» 
|/(m„)| >1/2 => f(un) 0. Since n^sin^w„ = rP'u^fiun)^ we see that for 
.1 > N, n^ul = sin^ m„/ f(un)^. Hence (by (12.1.33)) n^u^ {tb - tafco^ => 

nUn (tb - ta)o) (since > 0) => 



sin nUn 


-1/2 


sinitt - ta)o)] 


n sin Un 




1 

I 



1 - 1/2 



as n ^ 00 . 



(12.1.34) 



Actually > 0 since otherwise «„ = 0 => c^cu^/2 = 0 by (12.1.30). That is, 
by (12.1.31), (12.1.34) 



def 



I(ta, tb, 0, 0) = lim In(ta, tb, 0, 0) 



mo) 



littii sm{tb — ta)co 



1/2 



(12.1.35) 
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which together with (12.1.19) and Proposition 12.1 proves Theorem 12.1, modulo 
the proof of Proposition 12.4, which we now turn to. 

For n = 1, Mi(c) = 2c = 2cosw, whereas 

sin(n + 1)m/ sin M = 2 sin M cos m/ sin m = 2 cos u 

(where -1 < c < 1 => sinw 7^ 0). Proposition 12.4 is also true for n = 2: 



detM2(c) = det = 4c^ - 1 = 4cos^« - 1 (12.1.36) 

— 1 2c 



whereas 

sin 3u/ sin « = (3 sin « - 4 sin^ u) / sin u 

, , (12.1.37) 

= 3 - 4(1 - cos u) - 4cos w - 1. 

On the other hand, the following recursion formula holds: 

detM„+i(c) + detM„_i(c) = 2cdetM„(c). (12.1.38) 

Accordingly we may proceed by induction on n: 

sin(x ±y) = sin x cos y ± cos x sin y 

==^ sin(n + 2)u = sin((n + 1)m + w) = sin(n + l)wcosw + cos(n + l)«sinw, 

(12.1.39) 

and 



sin«M = sin((n + l)w - m) = sin(« + 1 )mcosm - cos(n + l)Msinw; (12.1.40) 

i.e., cos(w + l)MsinM = sin(n + 1 )mcosm - sinnu => 

sin(n + 2)u = 2sin(n + l)wcosw - sinnu. (12.1.41) 

By (12.1.38), with an induction hypothesis, 

detM„+i(c) = 2cdetM„(c) - detM„_i(c) 

= 2cosMsin(n + l)u/ sinu — sinnu/ sinw (12.1.42) 

= sin(« + 2)u/ sinw 



by (12.1.41), which completes the induction. 



Proposition 12.5. Suppose in Proposition 12.4 that c > 1. Write c = coshw. Then 



we can write 



det M„(c) = 



sinh(n + l)w 



The proof is analogous to that of Proposition 12.4 since the hyperbolic func- 
tions satisfy identities analogous to those of the trigonometric functions. Note that 
for c = 1, M„(c) is the Cartan matrix of the simple Lie algebra sl{n+ 1, C) of 
rank n. One has det M„(l) = n-\-\. 
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13.1 Euclidean Path Integrals 

Besides Feynman’s path integral formulation of quantum mechanics (and ex- 
tended formulations of quantum electrodynamics and other areas, as mentioned 
earlier), his path integral formulation of statistical mechanics has also proved to 
be a very useful development. The latter theory however involves Euclidean path 
integrals or Wiener type integrals, which rest on a more steady mathematical foun- 
dation. Compare remarks in Section 14.2 of the next chapter, for example, and 
also in Section 16.1 of Chapter 16. 

The Euclidean path integral is obtained formally from the Feynman path inte- 
gral by analytically continuing real time t to imaginary time -it. Under this Wick 
rotation t -it we declare that ^ ^ Then the 

action functional 

r ffi 

5(x(f))^J [--x'(tf-V{x{t))^i-i)dt^iSEix(t)), (13.1.1) 

where 

b 

SE(x(t))= [^x'(t)^ + Vix(t))\ dt (13.1.2) 

Jta 

is the Euclidean action integral of the path x{t)\ this corresponds to the action 
integral of our particle of mass m moving under the influence of the inverted 
potential -V. By (13.1.1) 
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and thus, in comparison with (11.2.5), we denote the Euclidean path integral 
Isita, tb, Xa, Xb) (not yet defined) by 



lE(ta,tb,Xa,Xb) = 



exp 



(13.1.4) 



rv. def 

Cft Cft — 



We also declare that -i(tb - to) so that in (12.1.27) 

n/2 

mn 

[lKh{tb-ta) 

Given (13.1.2) we have the reasonable approximation (for large n) 

n-\ 



SE{x("\t)) Ct 2 

j=0 



m 



-Xjf + €nV{Xj+i) 

ZSn 



(13.1.5) 



(13.1.6) 



as in (11.2.11). It is now clear by comparison with (11.2.12), (11.2.13), how to 
define /£(ta,t(,,Xa,Xfc) in (13.1.4). Namely 



lE(ta,tb,Xa,Xb) = lim /£,„(?«, tfc, X^, Xft), 
n-^oo 



(13.1.7) 



where for c„ in (13.1.5) 



lE,nQa, hb Xa, X^) = C„ J • • • j exp ^ j dXidX2 * * * dXn-l 



(13.1.8) 

with SE(x(t)) regarded as a function of xi, X 2 , . . . , x„_i, given by the right-hand 
side of (13.1.6). 

As a simple example consider again the free particle of mass m:V = 0. In this 
case we have 



Proposition 13.1. 



tb’f Xa, Xb) 



m 



iTUhitb - to) 



exp 



m {Xg - Xbf- \ 

Ih {tb-ta) j 



This compares with Theorem 1 1.4. 

Proof. The Proposition follows directly from Theorem 1 1 .6, just as Theorem 1 1 .4 
followed from Theorem 11.3. □ 



As a second example, which is non-trivial, consider again the simple harmonic 
oscillator with frequency v and potential V{x) = maP'x^ j 2 where w = Inv. To 
compute lE{taJb,Xa,Xb) we follow exactly the methods of Chapter 12, setting 
up the appropriate analogous results. First of all we need a classical path xdt) 
analogous to the path x(t) = Asin{cot 4- ^) in (12.1.1) there for V. Since the 
Euclidean action functional Se corresponds to the inverted potential -V (as noted 
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earlier) the corresponding equation of motion mx”(t) = F = -{-Vy = mco^x(t) 
reduces to x"{t) = co^x(t), which has as a general solution x{t) = A sinh(cut + ^). 
Thus in this section we take 



Xc(t) = A sinhicot + 



(13.1.9) 



where A, ^ G R are subject to the initial conditions xdta) = Xa, xdtb) = 
Xb. Using analogous identities like cosh^x + sinh^x = cosh2x, sinh(x ± y) = 
sinh X cosh 3 ; ± cosh x sinh y, cosh(x ±y) = cosh x cosh y ± sinh x sinh y one has, 
comparing the proof of Proposition 12.1, 

Proposition 13.2. For the classical path xdt) given in (13.1.9), the Euclidean 
action integral SEixdt)) in (13.1.2) is given by 



SEixdt)) = ^ 



(xl + cosh CO(tb - to) - IXaXb 
sinh(jo(tb - ta) 



(13.1.10) 



Proposition 13.2 compares with Proposition 12.1. 

Given the boundary conditions xdta) = Xa = x{ta), xdtb) = Xb = x{tb), 
Xd(ta) = xdtb) = 0 (cf. (12.1.13), (12.1.14)) where again we set x^(0 = x(t) - 
xdt) for a path x(t) (where xdt) will continue to be given by (13.1.9)), and the 
equation of motion x"(t) = a?xdt), one may proceed exactly as in the proof of 
Proposition 12.2 to establish 

Proposition 13.3. 



SE{x{t)) = SEixdt)) + SEixdt)). (13.1.11) 

As in the case of Feynman path integrals, given Proposition 13.3, we can fac- 
tor out the contribution of the classical path xdt) to the Euclidean path integral 
lEita, tb, Xfl, Xb) and write 



lEita,tb,Xa,Xb) = CXp iSEixdt)) /h) 

oxpi-SEixit))/h) D[xit)] (13.1.12) 

paths x(t) 

St. x{ta)=x(tb)=0 

(similar to (12.1.19)) which greatly simplifies matters. That is, 

lE(ta,tb,Xa,Xb) = e\p(-SE{Xc{t))/h) lim lE,n{ta,tb,0,0) (13.1.13) 

n^oo 



where for c„ given by (13.1.5) 

poo poo 

lE,n{ta, tb, 0, 0) = C~ ■ • • exp {- S E(x^”\t)) / H) dx^dXj ■ ■ • 

J -00 J -00 



(13.1.14) 
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with SE(x^^\t)) as a function of xi, . . . , x„_i given by 



n-l r 

^£(XW(0) = 2 

j=0 L 



m 



-(Xj+i-Xj) +€nV(Xj+i) 
Z6n 



(13.1.15) 



r def Pi 

tor €n = , n>2, Xo = Xa = 0 = Xb = Xn. 

n 

In place of A„ defined in (12.1.20) we now set 



A 

An = 



/ 2 + 6:^69^ -1 

-1 2 + elco^ 



\ 



\ 



2 + elo)^ -1 
-1 2 + elco^ ) 



; (13.1.16) 



An is an (n-l) x (n- 1) symmetric matrix. The analogue of Proposition 12.3 is 
the statement 



Xi, . . . , Xn—l An 



Xi 



Xn-l 



n-l 



= - Xjf + (13.1.17) 

j=0 



given the boundary conditions x^ = x^ = 0, x„ = x^, = 0, which allows us 
to write 



Se(x^"\i)) = ^xA„x‘ for X = (xi, . . . , x„_i), 

2€n 

and thus to use the result (1 1.A.7) in Appendix 1 1 A 

' 1 






exp (-AxMxO dx- ( ^ I 

VdetM 

for X = mj(2enh) to conclude that 



/;ry"“0/2 1 


m 


1/2 j 


\/detA„ 


27rh€n 


\/det An 



(13.1.18) 



(13.1.19) 



(13.1.20) 



By definition (13.1.16), A„ = M„_i(l + e\o? j2) so that by Proposition 12.5 we 
can write 



2 2 
etm 



, , smh«w„ ^ 

det A„ = — — for 1 + — — = cosh«„. 

smh Un 2 



(13.1.21) 



Applying the arguments of Chapter 12, following the statement of (12.1.31), we 
deduce that (as > 0) 



det An sinh nun sinh (o(tb - to) 

— — y 

n n sinh i/„ (o(tb - to) 



as « ^ 00 (13.1.22) 
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lE,n(^aJb^0i0) 


m 


1/2 


(0{tb - ta) 


2jth{tb-ta) 




sinhcu(4 - ta) 




mco 




■ 1/2 

as « c 




2nh sinh(rft - 


ta)cO 



-. 1/2 



(13.1.23) 



That is, by (13.1.13) and Proposition 13.2 we have established 

Theorem 13.1. For the simple harmonic oscillator with frequency v = co/2n 
and potential V{x) = co^mx^fl, the Euclidean path integral lE(taJb^^a,Xb) of 
(13.1.1) is given by 



lE(ta,tb,Xa,Xb) = QXp(-SE(Xc(t))/h) 



mco 



-,1/2 



2nh smh Q)(tb - to) 



(13.1.24) 



where the Euclidean action SE(Xc(t)) of the classical path xdt) = A sinh(o) t+^), 
A, ^ G R is given by 



SE(Xc(t)) = 



mCO[(xl -f xl) cosh Q)(tb - ta) - 2XaXb] 
2sinh(o(tb - ta) 



(13.1.25) 



see (13.1.2). 

Theorem 13.1 compares with Theorem 12.1. 

For X, 0 G R, sinhx = sinh2(|) = 2sinh-cosh^ sinhxtanh| = 

2 sinh^ Also cosh 26 = cosh(^ 6) = cosh^ 6 + sinh^ ^=14-2 sinh^ 6 => 
2 sinh^ I = cosh x - 1 => 



sinh X tanh ^ = 2 sinh^ ^ = cosh x - 1 . 
2 2 

Thus consider the special case Xa = Xb = x. By (13.1.25) 

2COsh a(tb - ta) - I 



•^Ei^cit)) = mcox 



sinh Q)(tb - ta) 



CO 



= mcox tanh —(tb - ta) > 0. 



Corollary 13.1 (to Theorem 13.1). 

mco 



lE(ta,tb.^.^) = 

for X G R. 



ll/2 



2nh sinh co(tb - ta) 



exp 



(- 



mcox 



(13.1.26) 



(13.1.27) 



CO 



tanh-(4-f«) 



13.2 Wave Function Expansion of the 
Euclidean Path Integral 

With a focus still on the example of the simple harmonic oscillator we relate 
the Euclidean path integral lE(taJb,Xa,Xb) computed in Theorem 13.1 to the 
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normalized wave functions y/n and energy levels of the oscillator. and En 
are given by 



Wnix) = (-1)" Y ^2^ ^ 

v/i = coh for n = 0, 1,2, 3, . . . ; (13.2.2) 

see Section 4.8 of Chapter 4. 

Hn is the «th Hermite polynomial. The following interesting expansion holds: 
Theorem 13.2. Isita, h, ^a, = 



00 

^ WniXa)Vn(Xb) CXp(-E„{tb - ta)/h). 
n=0 

The meaning of Theorem 13.2 in regards to quantum statistical mechanics will 
be commented on in the next chapter; see remarks following the statement of 
Theorem 14.1 there. The key mathematical tool needed to prove Theorem 13.2 is 
the Mehler formula 






: exp 



2xyz - (x^ + y^)z^ 

1 - 



)=i™ar. (■’■^■3, 

^ n=0 



or 



Vl - 



exp I 



' 4xk-(x" + /)(1+ ;^) 

. 2(1 -a 



0 



= exp(-(x> + ,^)/2)£^=<^(|)" (13.2.4) 

n=0 



for X, y G R, z € C with |z| < 1. 

To prove Theorem 13.2 we choose x = yfmcofh x^, y = \[moJh x^, z = 

def 

exp(-ft)(4 - ta))- Then for T = ta. 



I ±z^ = l± exp(-2cur) = 



l-z^ = 



2sinhcoT 
exp(cuT) ’ 



or 



exp(fuT) ± exp(-o)T) 
txp(o)T) 

1 _ exp(ft>T/2) 

y'l _ ^2 V2 sinhft>r 



Also by (13.1.25) 

4xy^-(x^ + y^)(l + z^) 
2(1 - z^) 



(13.2.5) 

(13.2.6) 
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exp(-a,T) -<f(xl+ x])2^ 



mco 

"ih 



I 2 sinh (oT \ 
cxpicoT) ) 

COSho)T “ IXaXb 

sinh coT 



= -SE{Xc{t))/h. 



By (13.2.1), (13.2.4), (13.2.6), and (13.2.7) we see that 
exp(cyr/2) 



V2 sinh coT 



exp(-5£(xc(t))/fi) = exp + A)) 



2^ ^73 exp(-nfflT) 



n=0 

mco 




2” n\ 

00 

^ y/n{xa)y/n{xb) exp(-nfflT) 



«=0 



^ y/niXa)WniXb) exp(-M®T) 



n=0 



-4 



mco 



Inh sinh coT 



exp(69T/2) exp(-5£(xc(0)/^) 



))//,) 

= ^ Wn{Xa)¥n{Xb) CXp 
oo 

= 'Yj¥n{Xa)Wn{Xb)^X^{-E„T/h), 
n=0 

which by Theorem 13.1 proves Theorem 13.2. 



(13.2.7) 



(13.2.8) 



(13.2.9) 



(13.2.10) 




14 

The Density Matrix and 
Partition Function in 
Quantum Statistical Mechanics 



14. 1 Helmholtz Free Energy, Entropy and Internal 
Energy 

In quantum statistical mechanics one has the fundamental object p{P\Xa,Xb), a 
matrix entry which defines the density matrix p, given in the Feynman formulation 
by the Euclidean path integral of the preceding chapter: 



cxb=x{pn) 

/)(/?; X,) = exp(-5£(x(0)//i) D[x(t)] 

Jx,=x(0) (14.1.1) 

= lE(0,Pft,Xa,Xi,) 



where ^ is the inverse temperature ^ up to a constant k being Boltz- 
mann’s constant. From p one obtains the all-important partition function Z as 
a trace: 



Z = p(P’, X, x) dx 

J— 00 /I /I 1 

roo rxiph)=x (14.1.2) 

expi-SE(x{t))/ii) D[x{t)] dx. 

J-00 Ja:(0)=x 

One usually regards p and Z as functions of the temperature T of some system 
in thermal equilibrium. Generally speaking the Hamiltonian H of the system has 
continuous spectral contributions. In the definition (14.1.2) we assume, strictly 
speaking, that H has a purely discrete spectrum — which is the case in the example 
of interest to us. 
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From Z one derives the basic thermodynamic quantities. For example: 



F = F{T)^ -kT log Z{T\ 


the Helmholtz free energy 


(14.1.3) 


Z = exp(-j^F), 




(14.1.4) 


. def dF 






S = S{T) = - 


the entropy 


(14.1.5) 


u = U(T) ^ F{T) + TS(T), 


the internal energy 


(14.1.6) 



= F{T)-T—, 



Note that Z = exp(-^F) => 



^ r_F 



kT^ 

~Y dT' 



(14.1.7) 



We compute F, S, and U in the example of continued interest, the harmonic 
oscillator. By Theorem 13.1 the density matrix as a path integral is given by 

Theorem 14.1. 



p(P;Xa,Xb) 



mco 

2jrh sinh phco 



1/2 

QXp(-SE(Xcit))/h) 



for 



(xl + xl) cosh phco - 2XaXb 
sinh pfico 

On the other hand, by Theorem 13.2, we can express the density matrix in 
terms of the normalized wave functions y/n and energy levels given in (13.2.1), 
(13.2.2) before its statement: 



SE(Xc(t)) = ^ 



00 

p{^\Xa,Xb) = 2 Jv/„(Xfc) exp(-£„^). (14.1.8) 

«=o 



In other words, the path integral definition (14.1.1) of the density matrix coin- 
cides with the more standard version expressed in (14.1.8), as formulated by von 
Neumann and Dirac. This is the meaning of Theorem 13.2 which we promised to 
comment on. 

By definition (14.1.1) and Corollary 13.1 



mco 



1/2 



p{p-,x,x) = 



2;rfisinha)^fi 



exp(-/lx^) 



(14.1.9) 
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for X = tanh Hence, by (14.1.2), 



met) 

27th sinh coph 



[27th sinh coph 



exp(-2x^) dx 



(14.1.10) 



[2sinha>j^/itanh ^Ph\ 
1 

2sinhfj^/i’ 



by (13.1.26). Using En = hco(n + 1 /2), we note that for f > 0 

y exp(-£„0 = exp(-trcot/2) T (expi-hcot))" = 

^ ^ 1 - txpi-hwt) 



(14.1.11) 



Qxp(hcot/2) - exp(-^o>t/2) 2 sinh ho)t/2 ' 

Hence the following well-known standard result holds: 



Theorem 14.2. 






(14.1.12) 



The representation Yi7=o ^W(~^nP) of Z also follows, of course, from (14.1.8) 
directly, since the y/n are normalized: 



WnMWnM dx=l. 



(14.1.13) 



Corollary 14.1. The free energy, entropy, and internal energy are given, respec- 
tively, by 



F(T) = ho)/2 kTlog(l - Qxpi-phco)), 



S(T) = k 



exp(j^^cu) - 1 



-log(l - txp{-phoo)) \ , and 



(14.1.14) 






Proof. We use the definitions in (14.1.3)-(14.1.6). 



F{T) = -kT log 



Isirihjfih 



= -kT — log(l - expi-hcop)) 



(14.1.15) 
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by (14.1.12), which gives the first formula in (14.1.14) since kT/) = 1. By that 
first formula 



kdT 



T 



e\p{-phco)^ 
1 ~ exp(-phco) 



- log(l “ QX^(-phco)) 



hcop 

exp(^^m) - 1 



- log(l - exp(-pm)), 



(14.1.16) 



which gives the second formula in (14.1.14), again as p = Using U(T) 

def 

= F(T) TS(T) one obtains the third formula of (14.1.14) immediately from 
the first two. □ 



By Corollary 14.1 we can understand, for example, the low and high temper- 
ature behavior of the internal energy. For low temperature (i.e., for p sufficiently 
large) the summand exp(^l^)-T (14.1.14) can be neglected. One obtains right 
away that U{T) ~ hco/2. To understand the high temperature behavior of C/(T), 
first recall the definition of the nth Bernoulli number 



X def ^ 
n=0 



where — 1, for x = 0. 
e^-l 



(UAAl) 



Thus one has, for example, Bo = I, B\ = -j, B 2 = g, B3 = 0, B 4 = 

B 5 =0, B^ = Bj = 0, Bg = Bg = 0, Bio = ^odd = 0. By (14.1.14), 
(14.1.17), 



U(T) = 



hco 1 

Y'^'p 



phoo 

QX^{pfi(o) - 1 



h(D 1 y BniphcoT 

^ /i=0 



(14.1.18) 



For high temperature T (i.e., for p sufficiently small) the following first approxi- 
mation is immediately obtained: 

U{T) ~ ^ ^ + Biphco] = -p=kT. (14.1.19) 



14.2 A Zeta Function Representation 
of the Free Energy 

The goal of the present section is to construct a zeta function such that 

the special value f'(0; T) of its derivative gives the free energy B(T), for the har- 
monic oscillator. This provides another example of the occurrence of various zeta 
functions in quantum theory. The main result. Theorem 14.3, moreover is of a 
prototypical nature. It serves as an important simple example of the following 
type of situation that occurs often enough in quantum field theory, cosmology, 
and other areas — a more complicated situation that we will encounter in Chap- 
ter 16. Namely, one is given, intuitively, a suitable partition function Z, perhaps 
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with some path integral representation; cf. (14.1.2). The problem is to give some 
precise meaning to log Z (or to the path integral itself); recall by (14.1.3), for ex- 

dcf 

ample, F(T) = constant T log Z(T). That is, one seeks a suitable regularization 
of the quantity log Z. One approach to the problem is to construct a suitable zeta 
function C such that log Z may be appropriately defined in terms of the special 
values f'(0) and f(0). We consider this approach in Chapter 16 in the study of 
finite temperature effects of scalar (or spinor) quantum fields over certain nega- 
tively curved space-times. It must be mentioned however that the quantity log Z 
might have two or more regularizations which are rather non-related. The remarks 
here serve to provide some meaning of, and perspective for Theorem 14.3, whose 
statement we now turn to. 

Our interest is in the following zeta function. 



^ ^ 4 - tfioF'P^y ’ 



Re 5 > - 
2 



where again /? = ^. 



(14.2.1) 



Theorem 14.3. The zeta function f (5; T) of (14.2.1) admits a meromorphic con- 
tinuation which in particular is holomorphic at s - 0. Moreover one has the 
relation 



TO = -— C'(0;T) (14.2.2) 

for the free energy. 

The proof is based on a classical Jacobi inversion formula for the following 
theta function defined for t > 0: 

6>(0 2 exp(-f4;rV). (14.2.3) 

neZ 

The inversion formula relates the value of ^ at r with its value at the inverse j of t: 

For a more general inversion formula, see Appendix 14A. It will be convenient to 
work with the slightly modified ^-function 0a defined by 

9a{t) = ^ exp(-r(4;r^«^ -I- a)) = Qxp(-ta) 6(t) (14.2.5) 

neZ 



for a> 0, a fixed. By the Jacobi formula (14.2.4) 



0a(t) = 



exp{-ta) 

Vijri 



^ exp(-«V40 

neZi 



exp(-ffl) ^ exp(-(gf + n^/4Q) 
y/Ant 



(14.2.6) 
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Defining 



Us) = 



r(5)J 



dt. 



we can commute integration and summation and write 



(14.2.7) 



Us) = y sxp{-t(43i^n^ + a))f ‘ dt 



= y — — 

ne£i 



for Re 5 > i 
2 



(14.2.8) 



since 



•00 

Qxp(-ta)f~^ dt = a~^T{s) for Re 5 > 0. 

.0 



(14.2.9) 



We see that the zeta function in (14.2.1) coincides with Ca for the choice a = 
fP'co^P^. Let us work therefore more generally with the zeta function i^a for an 
arbitrary a > 0. 

To meromorphically continue we use the Jacobi inversion formula (14.2.6). 
For Re 5 > I 



1 foo 2 °° 

m \2Vd) 



ns -1/2) 



V^r{s) 



V4/ri 

s-\j2 

K-s+i/iiVdn), 

(14.2.10) 



where Ks is the ^-Bessel function which is defined by 



A' 5 (x)=^i| exp ^ dt forx>0,5GC, (14.2.11) 



and which satisfies 

■“ exp(-(ut+A)) 

.0 V^t 

for a,b>0. On the other hand, 

(«/2)-^+'/2V^ 



\2^/dJ 



s-l/2 



K^s+miV^b) (14.2.12) 



K-s+1/2(01) = 



r(i-5) 



r 



exp(-ax)(x^ - 1) ^ dx (14.2.13) 
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for Re 5 < 1, a > 0; see [40, p. 958]. Thus for Re 5 < 1 we can write the second 
term in (14.2.10) as 

r(s);^ (2VS)->'!r(i-j) J, 



(14.2.14) 



= f i(exp(-V^x))"(x^ - 1)-* dx (14.2.14) 

_ ^ i_25 sin;r5 f” exp(~Vax)(x^ - 1)~" 

^ Ji l-exp(—v/5x) 

since r(l - s) F(s) = ;r(sin That is, 

Theorem 14.4. The meromorphic continuation of^a(s) to Re s < I is given by 
V4^ Ji exp(Vflx) - 1 



Corollary 14.2. is holomorphic at s = 0 and 

C'(0) = -Va - 21og(l - exp(— \/a)). 



(14.2.16) 



C(0) = + 2(V5)-^ f f” ,i. 

5=0 ^ 5=0 exp(Vflx)-l 



* » „»jl 



l\fn 



+ 2\/fl |* 



l 5 =oJi exp(Vflx)-! 
dx 

1 exp(V^x) - 1 



(14.2.17) 



since F(-l/2) = By the transformation of variables t — exp(V^x) - 1 

one has 



exp(Vflx) 



-7.1 



exp(Va6)-l 



1 1 



p(v^)-i L ^ ^ 1 . 



= 4= log ( 1 7^=77 ) “ log(l - exp(-Vfl)) 

y/a [ \ exp(vnn) / 

^ ^ log(l - exp(-Vfl)) 



as 00 . That is, 



— -pr - — - = — p log(l - exp(-Va)), 
1 exp(vflx) — 1 \a 



which proves the corollary. 




314 



14. The Density Matrix and Partition Function 



Having observed that the zeta function f ( 5 ; T) in (14.2.1) is given by ^a(s) for 
the choice a = we conclude by Corollary 14.2 that 



kT kT 

— ^C'(0;T) = —hwp + kT\og{\ - exp(-^ft?i^)) 



hco 



(14.2.20) 



+ kTlogil - exp(-^cu^)) = F(T), 



by Corollary 14.1, which proves Theorem 14.3. 

In Section 14.1 we considered the low and high temperature behavior of the 
internal energy U(T). Similarly, in Section 16.2 of Chapter 16 we shall consider 
the low and high temperature behavior of the free energy F(T), for F(T) defined 
in a more complicated context. 



Appendix 14A Jacobi Inversion 

Formula (14.2.4) is a special case of a more general Jacobi inversion formula. 
Namely for z € C, r > 0 set 

03 iz\r) = (14.A.1) 

ne^ 

Then 

Theorem 14.5 (Jacobi Inversion Formula). 

= (14.A.2) 

Vt j 

In particular for ^ = 0, f > 0 

6>3(0|4;tO = = 6>(0 (14.A.3) 

/zgZ 



for ^(0 in (14.2.3). Thus (14.A.2) implies (14.2.4): 






(14.A.4) 



For a certain application in Chapter 16 the following version of Jacobi inversion 
is required. 

Theorem 14.6. For r, ^ > 0, /i G C 



n^Tj 



J_ 

V4^ 



--nfip 






J_ 



00 

1+2 ^(cosh/i/^^)c 




n=\ 



(14.A.5) 
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Proof. In (14.A.2) choose t = z = Then ^ and ^ = 

ztl 

Ik ^ 






-litfi 4jct\ _ e "'Va I 

I /lO I r~. ^3 



P J V4^ V 2;r 



(14.A.6) 



If one multiplies both sides of (14.A.6) by one obtains exactly the first equa- 
tion in (14.A.5). The second equation there follows by writing 



n^TL 



= 1 + 2 + 2 

«=i 

= 1 + 2 ^^ [e«^'/» + e-«/'/’] . 



+ 2^e 4, 

n=l 






(14.A.7) 



n=\ 



□ 



In application the in Theorem 14.6 will be chosen to be the chemical potential 
of a quantum field over a suitable Kaluza-Klein space-time, p will be chosen as 
the inverse temperature of that field. 
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Zeta Regularization 



Zeta functions play a very useful role in physics, especially as they provide for 
a natural means of regularizing certain quantities which a priori are infinite, or 
meaningless. In Chapter 14 we presented a zeta function representation of the free 
energy of a harmonic oscillator. In Chapter 16 we shall employ a zeta function, for 
example, to provide for a clear mathematical meaning of a certain partition func- 
tion related to the Hawking path integral [45]. The Vacuum energy of certain quan- 
tum fields over various space-times is given by a special value of a suitable zeta 
function — the value 2 d s = -1/2, for example; cf. [4, 1 1, 12, 14, 88]; also compare 
the term in (16.1.47), where the special value s = -p/2 determines the vacuum 
or Casimir energy of a certain Kaluza-Klein space-time. One uses zeta functions 
to define the determinant in certain cases of infinite-dimensional operators. This 
is done by a simple procedure called zeta regularization. If D = 
is the non-Euclidean Laplace operator on the upper 1 /2-plane, for example, then 
one can define and compute the determinant of the operator -D + s(s - 1)1. This 
determinant arises in Polyakov string theory [26, 36, 70, 74]. Zeta regularization 
without doubt has proven to be a powerful tool for dealing with many interesting 
phenomena in physics including certain anomalies (more on this in Chapter 17), 
and ambiguities which arise in one-loop or external field approximations in quan- 
tum field theory (so-called ultraviolet divergences), to name a few. Before plung- 
ing into examples that involve more sophisticated zeta functions (such will appear 
in later chapters) we use this chapter to illustrate the regularization procedure in 
a few simple cases. The following example illustrates the basic idea. 

If n > 1 is an integer, then the factorial n\ = 1 • 2 • 3 • • • n is of course well 
defined, whereas oo! = l-2-3*--« = 11^=1 ^ since the infinite product 




318 



15. Zeta Regularization 



diverges. The question arises whether this infinite product can be regularized. That 
is, can one assign, in a somewhat natural way, a meaning to this infinite product, 
say by looking at it from some alternate point of view, that actually renders it 
jinitel The answer is yes, and it comes by way of the Riemann zeta function 

00 

CW=y-,Re5>l, (15.1.1) 

«=i 

which we have already encountered in Chapter 3. Formally we have by (15.1.1) 



C'is) = => -C'(0) = f^\ogm (15.1.2) 

m=l w=l 

where the latter series does not converge of course. Write, incorrectly, 

00 00 

e-f'(O) = eS:=iiogm ^ JJgiogm = (15.1.3) 

/n=l m=l 



That is, we can now think of the infinite product (which indeed is non-finite) 
as which is finite since the Riemann zeta function admits a meromorphic 
continuation to the full complex plane, which is in fact holomorphic at 5 = 0. In 
fact one knows that 



C'(0) = 



- log In 
2 



(15.1.4) 



Thus it is natural to define oo\ as i.e., oo! This simple example 

of zeta regularization provides the key to the method of regularizing an infinite- 
dimensional determinant. 

Namely, suppose we are given a linear operator T on some infinite-dimensional 
space. Assume T has a discrete spectrum {Aj, nj } where the multiplicity nj of 
the eigenvalue Aj is finite for each j. T could be an elliptic operator on a vector 
bundle over a compact manifold, for example. Somehow we should have 

00 

detr = n^?- (15.1.5) 

J=o 

But again the infinite product generally diverges. To regularize it, consider the 
spectral zeta function 

00 

Cr(5) = y5 for Res »0 (15.1.6) 

7=0 

of T where we are interested in the case when each Aj > 0, for example. Proceed- 
ing formally, as in (15.1.2), (15.1.3) we have (incorrectly) 

_ 1 fT ^ 00 

W = Z ^ -fr(0) - Z 

j=o j=o 



(15.1.7) 
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=» e-fr(O) = JJ e”) ‘“8 =Yl^7- ( ^ 

J=0 ;=0 

Thus by comparison with (15.1.3) it is natural to define the determinant of T by 

detr = (15.1.9) 

provided admits a meromorphic continuation at least to a half-plane Res > -e 
for some ^ > 0, such that is holomorphic at 0. We see how all of this works out 
for the example of the operators 

Tm = ^ + x^^, M = 1,2,3,... (15.1.10) 

on L^(R, dx), where TmW = Exi/ is a Schrodinger equation. 

The potential energy function F of a harmonic oscillator of mass m and fre- 
quency V is given by F(x) = as we have seen several times. More 

generally, we considered in Section 3.7 of Chapter 3 potential energy functions 

V of the form V{x) = Ax^^ for A > 0, M = 1, 2, 3, The case M = 2 gave 

rise to the anharmonic oscillator, which was considered in Chapter 10. The corre- 
sponding Hamiltonian operators (up to scaling) H = ^ + Ax^^ have a discrete 
spectrum {Ay(M),«y}^Q where the multiplicity nj of the eigenvalue Xj = Xj(M) 
is 1 for every j. By this result of Titchmarsh and the asymptotic behavior of the 
Xj(M) as j 00, given in Theorem 3.2, we checked in Corollary 3.2 that the 
corresponding spectral zeta functions (compare (15.1.6)) 

oo 1 

Ch(s) = Ch,m(s)= 2 ” X 77 (15.1.11) 

j=0 j=0 

converged absolutely for Re5 > and defined holomorphic functions on the 
half-planes Re5 > 0 < 2o < Ai < • • • , lim^-.oo Xj = 00. It is natural to 

inquire whether Ch admits a meromorphic continuation to the full complex plane, 
and if so whether Ch is holomorphic at 5 = 0. 

To illustrate the meromorphic continuation of Ch, take A = 1, M = 1, for 
example. Only for the case M = 1 are the eigenvalues Xj known: Xj = 2j + I and 
for Rqs > = 1 we noted (see (3.7.39)) that 

00 1 J 

= I 

n=0 

for f in (15.1.1). Equation (15.1.12) therefore provides for the meromorphic con- 
tinuation of Ch to the full complex plane. Since f is holomorphic except for a 
simple pole at 5 = 1, with residue 1, has only a simple pole at 5 = 1 with 
residue there equal to f . Also one knows that ^(0) = Hence (for M = 1) 

Corollary 15.1. C^(0) = log 2. 
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Now consider M arbitrary again. The zeta functions (15.1.11) (de- 

fined for Rc 5 > still admit a meromorphic continuation to C, with explicit 
poles and residues— due to Andre Voros. 

Theorem 15,1 (A. Voros [85]). ^h,m holomorphic at s = 0 and in fact 
fi?,M(0) = log sin here take A = 1. 

For M = 1, log sin 2 (u:ip[) l^S = "“5 l^S Theorem 15.1 indeed 

reduces to Corollary 15.1 in case M = 1. By definition (15.1.9) it follows that, 
for A = 1, 

Corollary 15,2. det Tm = esc 2 (m^) ’ 

Brief mention was made in the introductory remarks of this chapter about the 
occurrence of the determinant of Laplace type operators in Polyakov string theory. 
We shall have a bit more to say on this matter in the General Appendix C (follow- 
ing Chapter 18), where a concrete computational example will be considered. 
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Helmholtz Free Energy for Certain 
Negatively Curved Space-Times, 
and the Selberg Trace Formula 



16. 1 A Zeta Function for Kaluza-Klein Space-Times 
Modeled on SOi(d, l)/SO{d), and a Trace 
Formula for Compact Quotients of SOi(d, 1) 

It is very interesting to consider the computation of the quantum statistical quanti- 
ties F{T), S{T), U{T), for example, of Chapter 14, in a topological setting, given 
a suitable partition function Z(T), and in particular to understand the dependence 
of such quantities on the given topology. We are particularly interested in the case 
when the topology is that of a product W x M^, where is a compact hyper- 
bolic manifold of dimension d > 2, which we conveniently express as 

= r\X^ for r = 7Ti (M^) = the fundamental group of (16.1.1) 
and = its universal covering manifold. 

W X = R X (R^“^ X M^) is called a Kaluza-Klein space-time. Kaluza ex- 
tended Einstein’s four-dimensional theory of gravity to five dimensions (which 
for us corresponds to p = 1, d = 4) to incorporate Maxwell’s theory of electro- 
magnetism. Although a number of studies have been made on finite-temperature 
effects of quantum fields over space-times R^ x iV^, where the spatial sections 
have zero or positive curvature, only recent investigations have focussed on 
the case of negatively curved spatial sections M^. The results we present here 
therefore are new. They are a special case of more general results obtained by 
the author in [91] and presented at the Fourth Alexander Friedmann International 
Seminar on Gravitation and Cosmology, held in St. Petersburg, Russia. The key 
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mathematical tool needed to obtain these results is the Selberg trace formula, a 
non-commutative version of the Jacobi inversion formula; see Appendix 14A. 

Although we could take to be complex or quatemionic as well, we will 
assume for simplicity that is real. We can express as 



= {(x\,X2, . . .,Xd) ^ > 0}, (16.1.2) 



which is a manifold with a global chart (X^, 1 = (xi, . . . , xj)), where X/ is the ith 
projection function on X^. X^ has a canonical Riemannian structure given by 



gij{xi,...,Xd) = Sij/x], 



(16.1.3) 



“ being the (global) ith coordinate vector field on X^. The Laplace-Beltrami 
operator A corresponding to a metric g - [gtj] is (by definition) given by 



A : / A/ = 



1 

|detg|i/2 







(16.1.4) 



where g ^ = [g'-'], / G C^(X^). Clearly in our case g^^(x \,. . .,x^) = 5/^x^ 
by (16.1.3). Hence by (16.1.4) we immediately obtain 



Proposition 16.1. 






for the metric given in (16.1.3). In particular for d = 2 (in which case X^ = 
{(x, y) G > 0} is simply the upper half-plane, by (16.1.2)) 



A = y 



2 



/ 

\dx^ dy^ 



) 



Besides the concrete representation (16.1.2), X^ has a nice well-known group- 
theoretic realization as a homogeneous space: 



A^ = SOi(d,l)/SO(d) (16.1.5) 



where K = SO(d) is the special orthogonal group (see Chapter 7), and where for 
In = the nth order identity matrix and 




SO(p,^) = {g G Gl(p + ^,M)|detg = l,g^/;,^g = Ipq}, with 

dcf 

SOi(p,^) = the identity component of SO(p,g). 



( 16 . 1 . 6 ) 

(16.1.7) 

(16.1.8) 




16.1. A Zeta Function for Kaluza-Klein Space-Times 



323 



In the realization (16.1.5) weregardPin (16.1.1) as a torsion-free (i.e., only 1 G P 

def 

has finite order), discrete subgroup of G = SOi(d, 1) such that P\G is compact, 
since is compact. Since A is G-invariant, and thus is P-invariant in particular, 
A projects to a differential operator Ap on the quotient space 

r\x“ = r\G/K, (16.1.9) 

which is a compact negatively curved manifold. For convenience we work with 
-Ap rather than Ap. As an elliptic operator on a compact manifold 2fp, -Ap 
has a well defined spectrum where rij is the (finite) multiplicity of 

the eigenvalue Xj of -Ap. The minus sign preceding Ap provides for the non- 
negativity of the Xj, which we may order as follows: 

0 = 2o < < >^3 < • • • , with lim A, = oo. (16.1.10) 

j-^oo 

Note that in the important special case d = 2, Xr in (16.1.9) is the typical com- 
pact Riemann surface (with fundamental group P) of genus > 2; SOi(2, 1) is 
locally isomorphic to Sl(2, M), the group of real 2x2 matrices with determinant 
equal to 1. 

For a quantum field in thermal equilibrium at a finite temperature T = I /{kp) 
(we will take k = 1) on some curved background space-time, a suitable partition 
function Z exists with a path integral representation; cf. [2, 4, 11, 12, 17, 28, 
33, 45]. Similar to (14.1.2), where the integration is taken over periodic paths 
x(Pfi) = x(0), one obtains Z by integrating over periodic fields x(t) = x(r -I- P), 
where r = it is imaginary time; recall the Wick rotation t -it which we used 
earlier to transform Feynman integrals into Euclidean path integrals. A discussion 
of quantum fields is beyond the scope of these notes, of course. The notion of 
a measure on a space of fields over some space, moreover, is a bit too fuzzy to 
engage our time in discussion here. Fortunately, we can however achieve a clear 
mathematical meaning of the partition function Z by using a suitable zeta function 
fp attached to the space-times x (P\2f^), which we shall describe presently. 
Namely for a suitable constant c we may set 

logZ(^) = i[Cf(0) + cfr(0)]. (16.1.11) 

Actually, when p is odd (in particular when p = 1, a case of special importance) 
it will turn out that Cr(0) = 0. Since our interest is in the case of odd p it will 
not be necessary to specify c. Of course definition (16.1.1 1) is quite analogous to 
Theorem 14.3 which says precisely that 

logZ(^) = ic'(0), (16.1.12) 

for the harmonic oscillator, for the zeta function f given in (14.2.1). 

To describe the zeta function fp attached to W x = R x R^“^ x 

(T\X^) we employ not only the spectrum {Ay, of the Laplacian -Ap on 
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r\X^ as mentioned earlier, but also the spectrum of the differential operator 






def 







( 16 . 1 . 13 ) 



where // is a fixed complex number. // has a physical meaning: it serves to rep- 
resent the chemical potential of a scalar quantum field over Although we 
could take // = 0 we prefer to carry out our computations, more generally, in the 
presence of a non- vanishing chemical potential. Note that the periodic functions 



def 



fnit + P) = f„(t) = exp 
are eigenfunctions of D^ \ 



( Inin \ ^ 

(tv’ ' 



def 



D,f„ = XMn)fn iotk„{p,n) = 



Ikh 



1 2 



(16.1.14) 



(16.1.15) 



We use the eigenvalues Aj, /i) to form 0-functions (defined for t > 0) 



= X exp(-lA„(/?, H)), (16.1.16) 

«eZ 

00 

©r(?; b) = 2 exp(-i(i; -f b)), (16.1.17) 

7=0 

where 6 > 0 is fixed. In practice one takes b = where m is the scalar field mass, 
which we assume is non-zero; with a little care we could easily treat the case m = 
0 as well. With m ^0 there are no zero modes present; i.e., p) + Xj + m^ ^ 0 
for all n,j. We come now to the main definition of this chapter. The zeta function 
which provides for a clear meaning of the path integral Z(^) in (16.1.11) is 
defined by the following Mellin transform: 



def 



Cr(^) = Cr(5; /),/?,//) = 



T(s) 

roo 

• dt (16.1.18) 

Jo 

for Re 5 > dimXp^/2 = (p + d)/2. 

By the Jacobi inversion formula. Theorem 14.6 of Appendix 14A, we can write 



°° { n^B^\ 

1 + 2 ^(cosh rifip) exp ( — — \ 

n=l 



\f^t 



(16.1.19) 

(16.1.20) 
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Therefore 






[s-{p-\)/2]-\ 



V4^ 



0r(r;Z))f 



[s-p/l\-\ 



+ -^= ^ (cosh exp \ (16.1.21) 



so that if we set 

Cr(s; b) 0r(/; b) dt for Re s > -, 

r(i) Jo 2 

Mfi,,is;b) = IJ ^ |j(cosh«;/^)exp ®rinb) dt (16.1.22) 

for any 5 G C 



we have the following. 

Proposition 16.2. 



Cr(5;^, p,fi) 



P r(s-p/2) ^^ ^ M^,,(5-(p-1)/2;Z>) 

r(5) ’ (4;r)<P-i)/2 r(5) 



Theorem 16.1. h) is an entire function ofs. The zeta function Cr(«s; b) ad- 

mits an explicit meromorphic continuation in terms ofT -structure and in terms of 
the spherical harmonic analysis on the symmetric space GjK — SOi (t/, 1)/ SO(d). 
In particular for d even fr has at most simple poles at s = 1,2, .. .dfl with known 
residues. 



Indeed, using the Selberg trace formula, the author has worked out in [90] the 
complete, explicit meromorphic structure of fr(5; b). Thus Proposition 16.2 and 
Theorem 16. 1 provide for the meromorphic structure of the finite temperature zeta 
function fr(5; b, p, p), which we see is decomposed into an entire function 
translated by (p - l)/2, and a simpler zero temperature zeta function fp at zero 
chemical potential. 

An important special case is p = 1, d = 2, as indicated earlier. 

Proposition 16.3. If p is odd and d is even (example p = 1, and d = 2), then 



Cr(s',b,p,p) = 0 for s = 0, -1,-2, -3, 

Proof We use Proposition 16.2. = 0 for 5 = 0, -1, -2, -3, . . . and is 

entire. Thus 



M^,,(.-(p-l)/2) 
(47u)(P-^y^ T(s) 



for 5 = 0, -1,-2, -3, 



If p is odd, then T(5 - p/2) is finite for s = 0,-1, —2, —3, . . . , and similarly 
fr(5 - p/2; b) is finite for the latter values of 5 (by Theorem 16.1 as d is even). 
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Consequently 

p r{s-p/2) 



Cr(s - p!2\ 6) = 0 for 5 = 0, -1, -2, -3, . 



(4;r)^/2 T{s) 

Proposition 16.3 therefore follows by Proposition 16.2. 



□ 



In particular Cr(0; b, = 0 by Proposition 16.3, for p odd and d even, in 
which case (16.1.11) reduces to 



log Z{P) = -fp(0; b, P, p) for p odd, d even. 



(16.1.23) 



which is the exact analogue of Theorem 14.3. 

Further notation is introduced at this point, notation necessary for the statement 
of the trace formula and the main result. Let 



A = 



cosh t 0 sinh t 

0 U-i 0 
sinh t 0 cosh t 



t € . 



(16.1.24) 



which is a closed Abelian subgroup of G with Lie algebra 



def 



a„ - RH„ for = 



0 0 
0 

0 

1 0 



0 1 
0 

0 

0 0 



(16.1.25) 



recall I„ denotes the nth order identity matrix. Let Ad denote as usual the adjoint 
representation of G on its Lie algebra g„, and its complexified Lie algebra g = g^. 
Then for y € F - { 1 } we can define > 0 by 

def 

= max{|c| j c = an eigenvalue of Ad(y) : g ^ g}. (16.1.26) 

Actually ty arises as follows. Let M denote the centralizer of A in 

M= [meK\ma = am^aeA}^SO(d-l). (16.1.27) 

Also let 

A+ = [exptHo\t > 0} c A. (16.1.28) 

Then since F is torsion free, each y € F — { 1 } is conjugate in G to an element 
of MA+: 



€ G such that xyx * = niy exp{tyHo) € MA''. 



(16.1.29) 
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Moreover the element niy £ M is unique up to conjugation in M. The group G 
admits an Iwasawa decomposition 



G = KAN 



(16.1.30) 



where N = exp Hq for 



def 

no = 



0 x' 0 

“X 0 X 

0 x' 0 



X = 



column vector in 




(16.1.31) 



a subalgebra of go. Since my is unique up to conjugation in M the following 
function C on F - { 1 } is well defined: 



1 d£f 
^ ■ 



exp 




dtinS^d(myQxip(tyHo)) ^-1 . 



(16.1.32) 



Finally, again using that F is torsion free, we can associate to each y 6 F - { 1 } an 
integer j{y) > 1 : y has a unique representation 

Y = S^r) where j{y) > 1 is an integer, 6 € F - { 1 } is a prime; (16.1.33) 



i.e., S cannot be written as y[ for /i G F and 7 > 1 an integer. The definitions 
and notation from (16.1.24) to (16.1.33) obviously involve basis structure theory 
of the pair (G, F). Such structure theory is available for any pair (G, F) where G 
is a connected, split rank 1, non-compact semi-simple Lie group (G = SOi(d, 1) 
being an example of such) and where F is a co-compact, torsion-free, discrete 
subgroup of G; the reader may consult [89] for details. The latter reference con- 
tains also a proof of the following statement. For choice of Haar measure dx on 
G there is a unique G-invariant measure dx on F\G such that 



/(x) dx = 

JG 




dx(Fx) 



(16.1.34) 



for any continuous, compactly supported test function / on G. We set 

vol(r\G)=M Idx. (16.1.35) 

Jr\G 

The Selberg trace formula will now be stated for the triple (SOi(d, 1), SO(d), 
F). Actually we give a special version of the formula, one suitable for our pur- 
poses, which results from a special choice of test function. The more general ver- 
sion, for a general test function, and for a larger class of Lie groups G, is discussed 
in [89], where proofs are available in detail; also see [38, 48, 64, 77, 86 , 90]. 
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Theorem 16.2 (Selberg Trace Formula for co-compact quotients of SOi (d, 1)). 

As above letT C G = SO\{d, 1) be a torsion-free, discrete subgroup such that 
r\G is compact. Let 



©r(^; *) = X + ^)) 

J=0 



be the 6 -function <9/(16.1.17), where t,h > 0. Let Cp be a complete set of repre- 
sentatives in r of its conjugacy classes (here recall that Y\,Y 2 in F are conjugate 
ifyY = yYiY~^ for some y G Fj, and form the 0-function 



OriLb) 



def 



1 

\/4^ 



X 

yeCr-(l) 



tyKr) ^C{y)cxp 




where ty, C(y), j(Y)yforY G F- {1}, are given by (16.1.26), (16.1.32), (16.1.33). 
Then Or(t; b) is finite, and one has 



er(Lb) 



vol(F\G) 




r^ + b + 



d-l 

2 



•) 



\c{r)\-^dr + 9rit;b). 



(16.1.36) 

where for a suitable choice of Haar measure dx on G vol(F\G) is given by 
(16.1.35), andwhere |c(r)|“^ dr w Harish-Chandra-Plancherel mea- 

sure of the symmetric pair (SOi(d, 1), SO{d)), given by 



|c(r)|-^ 



Co^r P(r) tanh jrr for d = 2n 
Cq^ P{r) for d = 2« + 1 



with 



(16.1.37) 



Cg 



1 

24«-4 F(«)2’ 



Gg 



1 

24«-2 r(«+ 1/2)2’ 



n-2 



2 , (2.7+1)' 

r H : 



p{r ) = n 

9=0 

ford = 2n,n> 1, 



P(r) = J][r' + (n-yf] 
9=1 

for d = 2n + 1,« > 1. 



(16.1.38) 



We shall assume d = 2« is even and p is odd. Thus definition (16.1.23) applies. 
If one applies the trace formula (16.1.36), one obtains an alternative expression 
for Mp^^(s; h) in (16.1.22), which can be made more explicit by the commutation 
of integration with certain summations. To justify these commutations one needs 
an assumption on the chemical potential p. We assume 

|Re n\ < yfXu -y-. V^. 



(16.1.39) 
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where Ai is the first non-zero eigenvalue of -Ap; see (16.1.10). Then one can 
show that 

M^,^(s-(p-l)/2;b) 

^ /?VOl(r\G) ^_,+l+„;2 



00 

^(cosh 



^-s+p jiinpy/ r'^ + b + pi) |c(r)|' 



n=\ 



yjr^-vh-v pi 



s-pI2 



dr 






2y/b + pi 



s-(p+\)/2 



^(coshnpp) ^ tyjir) ^C{y) 



yeCr-{\} 






s-(p+l)/2 



for p subject to (16.1.36), where 



K. 



5+(p+l 



dei d — \ 

Po = 



)/2i‘^b + pl^rfip'^ + tl) 



(16.1.40) 



(16.1.41) 



and where the ^-Bessel function is defined in (14.2.11). In particular, by 
Proposition 16.2 

0 MnJ-(p-l)/2\b) 

C'^m,p,p) = -^T(-pl2)l^r{-pl2-, b) + ^ (16.1.42) 



(4;r)P/^ 

where 

M^,,(-(p-l)/2;6) 



(4;r)(P-l)/2 



;9vol(r\G) 



2^+p!2 



\f7t4n 

oo 

Y,(coshnpP)inP)-‘’^^ 



Kp/ 2 (nP\/r^ + b + pi) |c(r)|“ 



n=l 



^/r^ + b + pi 



-pf2 



dr 



(16.1.43) 



+ 



p[2y/b 



+ Po 



(p+\)/2 



]^(coshn//^) 2 tyjiy) ^C(y) 



n=\ 



\f^‘ 






-(p+l)/2 



rGCr-{l} 

^(p+i)/2('\/ b + pl^ rfifP- + tl). 



By analogy with (14.2.2) we define the Helmholtz free energy F{T) in the 
present context, of some quantum field in thermal equilibrium at finite tempera- 
ture T = ^ over our Kaluza-Klein space-time W x (F\ SO\(d, 1)/ SO(d)): 

F(T) = - i log Z(/?) = -^Cr(0; b, /?, p) 



(16.1.44) 
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by (16.1.23), where m = Vbis the mass of the field. Then by (16.1.42), (16.1.43) 
we obtain the following main result. 



Theorem 16.3. Assume p is odd, d is even, and p satisfies (16.1.39). Then the 
free energy F(T) in (16.1.44) is given by 






2(4;r)p/^ 
vol(r\G)2P/^ 

[ Kp/2{nfiyi 

m 



'^{coshn^l|})(npy 



•p/2 



/•2 + + pi) 






r^ + m'^ + pi 



p/2 



|c(r)|-^ dr 



^ 7 77 ^ — >(coshrt/i)9) > ty}{y) C(y) 



fn- 






•1 -(P+D /2 



nfi + pl\Jtfipi^ + 1]), (16.1.45) 



where po - {d- l)/2, Cr is the zeta function in (16.1.22), vol(F\G), ty, j{y)~^, 
C(y) are given in definitions (16.1.26) through (16.1.35), Ky is the K-Bessel 
function of (14.2.11), and |c(r)|“^ is the spherical Harish-Chandra-Plancherel 
density ofSO\{d, 1)/ SO(cf) given by (16.1.37), (16.1.38). 



Some remarks concerning Theorem 16.3 are in order. From definitions (16.1.17), 
(16.1.22) one has (commuting integration and summation) 



Cr(^;^) 



1 



J=0 



i^j + bY 



for Re 5 > 



d 

2 ’ 



(16.1.46) 



which realizes Cr as a spectral zeta function of Minakshisundaram-Pleijel type 
[65, 85]. The first term 



-r(-p/2) 

2{4n)P/^ 



Cri-pl2-,ir?) 



(16.1.47) 



in (16.1.45), which is temperature independent, is the Casimir energy of the 
space-time. In case p = 1, for example, the space-time E x (F\G/J^) is called 
Clifford-Klein, and a beautiful formula exists which expresses the Casimir energy 
in terms of the logarithmic derivative i/^r of the Selberg zeta function attached to 
r\G/^ [11, 12, 14, 38, 89]. Such a formula is established in [88], for a general 
rank 1 space form F\G/J^, and is a consequence of the following formula (also 
proved in [88]) which provides a direct relationship between and xpr' 



Cr(s;b) 



voi(r\G) 
4;r 
sin 



I(s;b) 



in 7TS p 

^ Jo 



¥r(Po + t + a(b))(2a(b)t + t^) ^ dt 



for Rqs < 1, 



(16.1.48) 
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where a(b) = ^Jb + pi, and where I(s;b) is explicitly given in terms of the 
Plancherel density |c(r)|“^. Formula (16.1.48) also holds for b = 0, but in this 
case the summation in (16.1.46) begins with 7 = 1, and we require Re 5 < 0. 
We can relate the Casimir energy with the logarithmic derivative y/r of Selberg’s 
zeta function for any odd p, more generally. Of course formula (16.1.48) is of 
independent interest. 

Consider the second and third terms in ( 1 6. 1 .45) which represent the non-trivial 
(temperature dependent) contribution to the free energy. The sum of the latter two 
terms corresponds to QT^{P,p) in [91], where the superscript indicates that 
the field is bosonic, whereas the superscript in [91] is reserved for fermionic 
fields. By [40, p. 967] 



Kq+\/2(x) 



-m 



iq + k)\ 
k\(q - ky.ilx)'^ 



(16.1.49) 



Write p = 2q+ I so that p/2 = q + 1/2. The choice x = + nfi + pi then 

allows us to express the second term in (16.1.45) as 



vol(r\G)2^ 



^(coshnpP)n 



k=0 



k\{q-k)\ 



(4;r)(^+0/2^(/>+l)/2 

+ m^ + (^-npf^ + m^ + pl^ {Inpy'^ |c(r)|-^ dr, 

(16.1.50) 



which with the third term in (16.1.45) provides for the content of Theorem 3.1 
of [91]. For the bosonic case there where r~ is taken = 1, vol(M^) = vol(r\G)/4;r, 
and F(T) is referred to as the one-loop effective potential p) or thermody- 
namic potential. 



16.2 Temperature Asymptotics 

In Section 14.1 of Chapter 14 we obtained the low and high temperature behavior 
of U(T). Similarly by Theorem 16.3 we can obtain a low and high temperature 
asymptotic expansion of F{T). For example the following expansion is good for 
large x: 





U r(v + fc+l/2) 6»r(v + / + l/2) 


V 2x 


^{2xf k\T{v-k-\/2) ' (2x)' /! T(v-/ + 1/2) 



(16.2.1) 



say for V G R, X > 0, / > v - 1/2 (/ = 1,2, 3, . . . ), where 1^| < 1; see [40, p. 963]. 
To obtain an expression for F(T) good for low temperatures T (i.e., for large 

we take x = pi v = (p + l)/2. 
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Theorem 16.4. Using (16.2.1), we can write the third term in (16.1.45) as 

]p/^ 



+ pi 



I W(rt-‘C(rt 

^ ^ «=1 r€Cr-{l) 

exp + pl^J n'^ tj^ 

1 F(^ + ^ + 3/2) 



t=o {2yjm'^ + pl^Jn^p^ + k\ 



~2.k.S^q-k + 2/2) 



+e- 



r(^ + / + 3/2) 



2^m^ plyJn^P^ + t 



1/ 



/!F(^-/ + 3/2) 



for any integer I > p/2; here \9\ < 1. 

Using the expression (16.1.50) for the second term in (16.1.45) and [40, for- 
mula 8.446, p. 961] for the behavior of Kg+i{x) for small x > 0 (^ = 0, 1, 2, . . . ), 
we can rewrite the third term in (16.1.45) and obtain similarly high temperature 
asymptotics for F(T), which with Theorems 16.3, 16.4 extend the results of [10] 
(which focussed on the important special case p = 1) to any odd dimension p of 
the Euclidean factor of the space-time R^ x T\X^. 

The results of this chapter, including the trace formula, can be extended to a 
vector bundle setting — namely to an automorphic vector bundle ^ T\X^ 
induced by a finite-dimensional unitary representation x of F. For simplicity, 
we have taken / to be the one-dimensional trivial representation in this chap- 
ter. We will take up another application of the trace formula in the next chap- 
ter. For further applications of this important formula (and of the Selberg zeta 
function) in quantum mechanics and in quantum field theory the reader may con- 
sult [1, 11, 12, 16, 37, 52]. 

The idea of using the trace formula to find a fairly explicit meromorphic contin- 
uation of the spectral zeta function in Theorem 16.1 goes back to Burton Randol 
[72], in the special (but important) case when d = 2. The extension of Randol’s 
idea to an arbitrary split rank one setting is carried out in [90]. Spectral zeta func- 
tions are defined for any compact manifold (F\ G/K being a special case), and 
their meromorphic continuation is carried out in general, but not explicitly, by 
S. Minakshisundaram and A. Pleijel [65]. 
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The Zeta Function of a Product 
of Laplace Operators and the 
Multiplicative Anomaly for 



17.1 det L 1 L 2 = det Li det L 2 ? 

In Chapter 15 we defined the determinant detL, in certain situations, of an oper- 
ator L on an infinite-dimensional space by way of zeta regularization. Given two 
operators Li, L 2 it is natural to inquire whether the familiar formula 

detLiL2 = detLidetL2 (17.1.1) 

remains valid. We shall see in general that it does not. The question of the validity 
of (17.1.1) is one not only of pure mathematical interest but is of practical interest 
as well, as it arises in various field theories. Namely, in certain settings [13, 18], 
it is important to compute the multiplicative anomaly cr(Li, L 2 ) defined by 

a{U,L 2 )^= logdetLiI, 2 -logdetLi -logdetL 2 ; (17.1.2) 

i.e., detLiL 2 = exp(a(Li,L 2 ))detLi detl. 2 . (17.1.3) 

We see that (17.1.1) is valid if and only if the anomaly cr(Li, L 2 ) vanishes. 

Of particular interest is the computation of a(Li, L 2 ) when Li , L 2 are Laplace 
type operators. Taking advantage of the notation set up in Chapter 16 we specifi- 
cally choose 



L, = -Ar + ^, r= 1,2, (17.1.4) 

where h\,b 2 > ^ (^ 1 , ^2 may represent some field masses), and where Ap is the 
Laplace-Beltrami operator on = T\SOi(d, l)/SO(d); see Proposition 16.1 
and remarks following it. The computation of a{L\ , L2) for such Lr is non-trivial. 
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The key mathematical tool which we rely on is, again, the Selberg trace formula — 
formula (16.1.36) of the previous chapter. An attempt to provide a full array of 
details, as we have done in other discussions, will not be undertaken here. We will 
highlight some of the key points involved in the computation, again making free 
use of the notation of Chapter 16. The starting point is the study of the spectral 
zeta function of the product of Laplace operators L 1 L 2 . 

def 

As {/ly, denotes the spectrum of -Ap, the spectrum of L;. = - Ap + br, 
I < r < 2, is {Ij + br, corresponding spectral zeta function of Lr 

(compare (15.1.6)) is 



&,W = I 



J=o 



i^j+brY 



for Res > 

2 



(17.1.5) 



That is, by (16.1.46) 



CL,(5) = Cr(s;^). (17.1.6) 

and by definition (15.1.9) 

detL/= exp(-Cf(0;6r)). (17.1.7) 

The spectrum of L 1 L 2 is {(Aj + b\)(Aj + ^2)»«y}]lo (which is consistent with 
finite-dimensional linear algebra), and the corresponding spectral zeta function of 
the product of Laplacians L 1 L 2 is given by 

00 

Cr(s; bi,b 2 ) “ y rf, w i for Res sufficiently large. 

“ [{^j + bOiAj + b2)Y 

(17.1.8) 

Definition (17.1.8) is the key definition for the present chapter; see Proposi- 
tion 17.1 below. If b\ = b 2 , then Cr( 5 ;/?i,^ 2 ) = Cr(25;Z?i), whose meromorphic 
structure is explicitly known [90], as pointed out in Theorem 16.1. Therefore we 
always assume b\ b 2 , say b\ > b 2 . 

Proposition 17.1. Cr(s\b\,b 2 ) is a well-defined holomorphic function on 
Re s > f. 

The problem before us is to show that C^p(*s; b\,b 2 ) admits a meromorphic con- 
tinuation to C which in particular is holomorphic at 5 = 0. For then by defini- 
tion (15.1.9) we would set 



detLiL2 = exp(-Cf(0;^i,W), (17.1.9) 

and by (17.1.2), (17.1.7), (17.1.9) the goal would be to compute 

a{LuL2)= -Cf(0;6i,«>2) + C^(0;i>i) + Cr(0;i>2). (17.1.10) 

We would like to have a very explicit description of the meromorphic continu- 
ation of Cr(‘S';^i,/> 2 ) — say in terms of F-structure and in terms of the spherical 
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harmonic analysis for the symmetric space G/K = SO\(d, 1)/ SO(J), as we have 
for the zeta function b), ^ > 0. We can achieve such a description, as we 
now indicate, by use of the trace formula. 



17.2 Explicit Meromorphic Structure of CrC-s; b\, ^> 2 ) 

Define 

(M h^ b 2 . ^ b\ = b+-\- b-, bi = b+- with b- > 0; 

2 ’ ’ ' + ’ ^ + (17.^ 

i.e., b\ > b 2 by assumption. 



As the .ST-Bessel functions Ky have been vital for Chapters 14, 16, the modified 
Bessel functions ly are vital for the present chapter: 






(tyy 

\2/ ^^r(v + m + l)w! 



for t > 0, V G C; 



(17.2.2) 



the series in (17.2.2) converges absolutely, by the ratio test. We are interested in 
the following Laplace-Mellin transform of ly . 

I{s-XM,b2) = 

for Re5 > 0, A > 0. 






Is-u2ib-t)r^'^ dt 



07.2.3') 



We shall see that I{s\ A, b\,b 2 ) is well defined. Using some standard asymptotics 
of Iy{t) for large and small t > 0 [34] one sees that 

Proposition 17.2. The integral 

f" e-^ny{t)f dt 
Jo 

converges absolutely for a > I, Re {fi + v) > —1. 

By the transformation t t/b- 

poo 

/( 5 ; A, bi,b2) = bT^'^ dt (17.2.4) 

Jo 

for a = ^ + ^, where by (17.2.1) b+ - b- = b 2 > 0 => b+ > b- => ^ > 
1 => a > 1 for A > 0. That is, by Proposition 17.2, /(s; A, b\ , Z> 2 ) is indeed well 
defined for A > 0, Re 5 > 0. 

Theorem 17.1. For b\ > b 2 > 0, Re s > 0, A >0 



I{s;A,bub2) = {2b^y-^^^ 



r(5) 1 

^ (A + biy(A + b2y 



where />_ = (b\ — b 2 )/ 2 . 
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Theorem 17. 1 is critical for our computation of a(Li , L 2 ); we indicate its proof. 
By [40, formula 5, p. 713] 

I ” dt = (17.2.5) 



for Re (v + //) > -1, where ^ is an associated Legendre function of tht first 
kind. Concerning we shall only need the special values formula 



Pj, ''(coshr) = 



1 

r(v + i) 




(17.2.6) 



given on [40, p. 1008]. For a = (A + b^.)/b- > 1 in (17.2.4), choose /? = — ^=. 
Then 






1 




(17.2.7) 



=4> 1 

for Res > 0 



^ D 



cosh t for some t > 0 



sinh^ t = - 



p-(s-l/2) 

■^s-1/2 



(/?) = 



1 

F(s + 1/2) 




s-l/2 




(17.2.8) 



by (17.2.6). Since - 1) = a (by (17.2.7)) we have by (17.2.4), 

(17.2.5), (17.2.8) 



J(s;A, 61,62) = 



bT'^^ F(2s) («2 - l)-(^+>/2)/2 
F(s + 1/2) 



2\/a^-l 



s-1/2 



(17.2.9) 



F(s + 1/2) 



Now 






(A + 62)(A + 6i) 



(17.2.10) 



by (17.2.1). Also by the Legendre duplicating formula 



F(2s) 

r(s+l/2) 



s-1 



y/jU 



■ns). 



(17.2.11) 



This with (17.2.10) plugged into (17.2.9) gives the desired formula for 
I{s;A,bub2). □ 
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For the ^-function ©r(t; b) in (16.1.17) with the choice b = b+ = > 0 we 

consider, similarly, the transform 



Ir(s;bi,b2) 



•00 

0r(r;6+) ls-i/2ib-t) dt. 

*0 



We state without proof the following. 



(17.2.12) 



Theorem 17.2. The integral in (17.2.12) converges absolutely for Re s > and 
one may commute the integration m (17.2.12) with the summation in 0p. 



Thus by Theorem 17.2 and definition (17.2.3) one has for Re 5 > | 



00 TOO 

Ir(s;bub 2 ) - T exp(-?(^ + 6 +)) Is-i/iib-t) dt 
y=0 Jo 
00 

= ^njI(s;Aj,bi,b2). 
y=o 

That is, by Theorem 17.1 and definitions (17.1.8), (17.2.12) 

(2i_)*-'/2 r(i) 



(17.2.13) 






-Cr(s',b\,b2) 



-00 1 

= Brit; b+) Is-iMb-t) dt for Res > 

Jo 4 



(17.2.14) 



where we recall that fr(‘^; b\,b 2 ) is holomorphic in s on Res > d/4 by Proposi- 
tion 17.1. By the way, equation (17.2.14) is the analogue of equation (16.1.22). 
The main point about equation (17.2.14) (as with equation (16.1.22)) is that 
one is placed exactly in position to apply the trace formula. Namely, proceed- 
ing as in [90], one essentially obtains the explicit meromorphic continuation of 
Cr(*s; bub 2 ) to C by plugging into (17.2.14) the right-hand side of (16.1.36) and 
applying Fubini’s theorem. Because of the factor Is-i/ 2 (b-t) in (17.2.14), the ar- 
guments now are a bit more difficult than those in [90], but on the other hand they 
are also more interesting. 

To state the main result we introduce further notation. For a e C, n > 0 m 
integer the corresponding Pochhammer symbol (a)n is defined by 

(a)/=a(a+l)(a + 2)---(a + n-l) = ^^^;(a)/= 1. (17.2.15) 

r(a) 

Then the Gauss hypergeometric function F(a, b; c; z) is defined by 

F(a, b\ c; z) ^ for |z| < 1. (17.2.16) 



For y > 0 an integer and , B 2 > 0 set 



Ej{s\B\,B2) = 2 



^2;+l 



0 (r^ + BxY{r^ ^ 2)^(1 + exp( 2 ;rr)) 



for s G C. 



(17.2.17) 
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For the case of d = 2n even let 

|c(r)|-2 = CGJrrP{r)tmhKr (17.2.18) 

be the Harish-Chandra-Plancherel density for GJK = SOi(J, l)/SO{d) as be- 
fore, where Cq, P{r) are given by (16.1.38). Write 



[2^(2; + ir 






(17.2.19) 



For the ^-function 9r(t; b) of Theorem 16.2 with the choice b = b+, set 

Eris-, buh) = (2Z)_)'/2-*-^ [ Orif, 6+) h-Miib-t) dt for 5 € C. 

r(s) Jo 



Finally, set 



(17.2.20) 






-1-^/ for 7 = 1,2; then 



0 < — B 2 = — Z?2 = 2b- 



B 7 — 2 



B] — Bo 



+ b\ + b2 — 2 



B1+B2 6 + + (^) 



(17.2.21) 

(17.2.22) 



(17.2.23) 



(17.2.24) 



The form of the meromorphic continuation of fr(*5; ^i, ^ 2 ) is determined by the 
parity of d. We will state the main theorem only in the case when d = 2n is 
even; this is the more difficult and more interesting case. Again we always assume 
^1 > > 0. 

Theorem 17.3 (The meromorphic continuation of the zeta function fr in 
(17.1.8) of a product of Laplacians). Suppose the dimension d of the symmetric 
space G/K = SO\(d, 1)/ SO(d) is even: d = 2n. Then for Re s > ^ 

;ris\bi,b2) = 

4)t 



«-l ^2jJ 

1- 






(2s - l)(2s - 2) • • • (2s - j)(2s ~U + 1)) 



vol(F\G) 



^ oijEjis', Bi,B 2) + Er(s', b\, (> 2 ), 



(17.2.25) 
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where, for a suitable normalization of Haar measure on G, vol(r\G) is given 
by (16.1.35) and Co = (2^"”^ T{n)^)~^. The coefficients a 2 j are determined 
by the spherical Harish-Chandra-Plancherel measure |c(r)|”^ dr of G/K; see 
(17.2.18), (17.2.19). B\,B 2 are given by (17.2.21), F is the hypergeometric func- 
tion of (17.2.16), and Ej, Er are entire functions given by (17.2.17), (17.2.20). 

Since s ^ F{a, b\ s\ c)/T{s) is an entire function, for a,b,c e C fixed, \c\ < 
1, we multiply the first term in (17.2.25) by F(s + l/2)/r(5 + 1/2) and con- 
clude that the poles of Cr(*5; ^ 2 ) are simple and are located at the points 5 = 

“I, - 5 , . . . , 5 = . . . , In particular ^ 1 , ^ 2 ) is holomorphic 

at 5 = 0, and the determinant detLiL 2 is well defined in (17.1.9). Using Theo- 
rem 17.3 and results in [90], we can compute fp(0; Z>i, ^ 2 ) and ^p(0; bj), 7 = 1,2, 
and thus the multiplicative anomaly cr(Li,L 2 ) in (17.1.10). After some tedious 
calculations one arrives at the following theorem. 

Theorem 17.4. Suppose d = 2n is even, and Cq, vo1(F\G), Bj, a 2 j are as in 
Theorem 17.3. Then 



a(LuL2) 



Gg 



voi(r\G)l 



n-l 

= I 

j=0 

j 

p=3 






(p+l)p\ {j-pV- 



I I ^ I 

-—-1— 



p p- 



(17.2.26) 



In particular a{Li,L 2 ) = 0/or d — 2 (and thus equation (17.1.1) is valid), but 
for d = 4 



a( 7 'i, 7^2) 



Ccvol(r\G) 

16 



a2(bi - b2f 5 ^ 0 . 



(17.2.27) 



7 /d«odd,a(Li,L 2 ) = 0 . 

We see that an explicit expression is obtained for the anomaly a(Li, L 2 ) in the 
special case of a hyperbolic space form X^, where Li , L 2 are Laplace type opera- 
tors. In general a(Li, L 2 ), for a class of invertible, elliptic, pseudo-differential op- 
erators L\, L 2 , can be expressed in terms of Wodzicki’s non-commutative residue 
[92, 93]; also compare [13, 18, 29, 59]. One knows that (consistent with Theo- 
rem 17.4) one always has a(Li, L 2 ) = 0 for odd dimensional manifolds [59]. The 
importance of the Wodzicki residue for physics has come to light recently. This 
residue has been considered, for example, in the approach via non-commutative 
geometry to the standard model of electroweak interactions [20, 21, 54, 55]. 
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Schrodinger’s Equation 
and Gauge Theory 

Patrick Shanahan 



18.1 A Brief Introduction to Gauge Theory 

Our starting point is the Schrodinger equation for the wave function T of a charged 
particle in an electromagnetic field {E, B) defined on space-time 

or equivalently, 

ifi (^d, + f T 




( 18 . 1 . 1 ) 



Here q is the charge and A and ^ are vector and scalar potentials for the field. 
Thus, curl A = B and -grad^ = E + dA/dt. That such potentials exist is a 
consequence of Maxwell’s equations for the given electromagnetic field. 

The potentials A and 4> are far from unique. If (A, 4>) is a potential pair for the 
field (E, jB), then so is (A', where 

A' = A -H grad / and ^ ^ 

ot 

for some function f = f(x, t), and conversely. This ambiguity is the beginning of 
the subject known as gauge theory. 
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Suppose that in (18.1.1) we replace A and ^ by the equally valid potentials A' 
and With the aid of the integrating factor one may show that if T is 
a solution of (18.1.1), then Y = is a solution of the new equation. Of 

course, since \ foj- each (x,0, the probability \^(x,1)f' of finding 

the particle at point x and at time t will remain unchanged if ^ is replaced by Y. 
Nevertheless, the fact that the complex numbers T(x, t) and Y(x, t) have different 
phases (i.e., arguments) cannot be disregarded. For example, if Ti and T 2 are 
solutions to a particular Schrodinger equation, then so is the superposition T' = 
+ ^^ 2 ^ and to compute |T(x, t)\^, it is necessary to know the phases of T^i(x, 0 
and (or at least the difference between these phases) as well as their 

moduli. 

The question we will address here is the following: how can one deal efficiently 
with the “relativity of phase” of the solutions to the Schrodinger equation for a 
charged particle which stems from the ambiguity in the choice of the potentials A 
and 

The situation is a bit like the one that one encounters when working with vec- 
tors. In order to carry out computations involving vectors one must choose a coor- 
dinate system, i.e., a basis. When a different basis is chosen, the components of a 
given vector are replaced by new ones. Thus the components are relative entities 
which depend on a choice of basis, while the invariant object they represent is the 
vector itself. 

There is another question, closely related to the problem of ambiguity of phase. 
For each choice (A, 4>) of potentials for a given electromagnetic field we obtain a 
different Schrodinger equation, although the physical situation remains the same. 
To put this in terms of differential operators, for the potentials A and <p, the rele- 
vant partial differential operators are 



d i 



and 




where A,, / = 1, 2, 3, is the zth component of the vector potential A. (Modified 
operators such as these are an example of covariant derivative operators, which 
we will have more to say about later.) If one chooses instead the potentials A' and 
for the given electromagnetic field, the relevant operators are 



and 



d i d i i df 





L ^ 

h ^ dxi ’ 



where A' = A -I- grad / and = 4> — df /dt. In other words, the ambiguity in 
the choice of vector and scalar potentials leads to a sort of “relativity of covariant 
derivatives.” 

In the remainder of this chapter, this can be put into a unifying geometric con- 
text, one that has far-reaching consequences for the understanding not only of 
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the electromagnetic force, but also for nuclear forces, such as the weak force and 
the strong force, and ultimately even the force which is most familiar to us in 
everyday life, the gravitational force. In order to do this it is necessary to discuss 
(briefly) the concepts of vector bundles, sections of a bundle, covariant derivatives 
of sections, and finally, the curvature associated to a given covariant derivative. 

18.2 

We briefly recall the concept of a fiber bundle. Let E and X be topological spaces, 
and let ;r : E ^ ^ be a mapping of E onto X. Suppose that for each x G X the 
inverse image 7r~^{x) is homeomorphic to a fixed space F. (See Figure 18.1). 




Figure 18.1: Basic set-up for a fiber bundle 



The simplest example of such a system is the product space X x F, where 
;r = Pi is the standard projection of X x F onto X. In general, however, E will 
not be homeomorphic to a product of two spaces. 

Let us now assume that E is locally equivalent to a product, in the sense that for 
each X in X, there is a neighborhood U of x and a homeomorphism h : 7r~^(U) 

(7 xF which is compatible with the projections ;r : U andpi :UxF^ 

U. That is, we require for each e G that 

pi{h(e)) = Kie). 

In this situation we say that E is 3. fiber bundle over X with fiber F. Intuitively, a 
fiber bundle is a sort of “twisted product.” The set X is called the base space of 
the bundle, E is called the total space of the bundle, and tt is called the projection. 
The space = 7t~^(x) is called the fiber over x. 

A section 5 of a fiber bundle is a mapping s \ X ^ E such that ;r(5(x)) = x for 
each X G Jf. 

A cimnlp PYamnlp nf a fihpr hiinHlp that iq not a nrnHiirt hiinHlp i<i thp Mnhiiis 
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vectors to a smooth ^-dimensional manifold M is a fiber bundle, with base space 
M and fiber M"; this bundle is called the tangent bundle of M, denoted TM. A 
section of TM is simply a field of tangent vectors on M. 

A fiber bundle E is said to be trivial, or equivalent to a product bundle, if there 
exists a homeomorphism h : E X x F that is compatible with the projections 
7T and Pi . 

Let jE be a fiber bundle and let hi : Ui x F be a local trivialization. 

For each x € Ui, let ^,(x) : E^ F bt the restriction of hi to Ex, followed by 
the projection p 2 of Ui x F onto F. If hj : 7r~^(Uj) Uj x F is another local 
trivialization, then for each x G fi Uj the composite gji{x) = ^y(x) o ^,(x)“^ is 
a homeomorphism 

gjiix) :F^F. 

(See Figure 18.2.) 



F 




Figure 18.2: Transition mappings 

We will refer to the mappings gji(x) as “transition mappings” for the fiber Ex. 
It follows immediately from the definition of the transition mappings that for each 
/ and each x eUi, gu is the identity mapping of F, and that if x G Ui fi Uj fi 
then gki = gkj o gji. 

As an example, consider the Moebius band, regarded as a fiber bundle whose 
base is the circle and whose fiber is the interval [-1,1]. For a trivializing cov- 
ering we may take any two open arcs U\ and U2 which cover the circle. The local 
trivializations h\ and h2 may then be chosen in such a way that the transition func- 
tions g2i(x) preserve or reverse the orientation of the fiber depending on which 
connected component of U\ fi U2 the point x belongs to. 

Let E be a fiber bundle whose fiber F is an ^-dimensional vector space. If 
each of the fibers Ex is a vector space, such that the mappings ^/(x) : Ex F 
are linear, then E is called a vector bundle of dimension n. When the fiber F is 
R" (resp., C'*), the bundle E is called a real (resp., complex) vector bundle. For 
example, the tangent bundle to a smooth manifold is a real vector bundle, whose 
transition mappings are given by the derivatives of the change-of-coordinate map- 
pings associated to local coordinate charts. 

Let E be an ^-dimensional vector bundle, and let : n~^{U) U x E be 
a local trivialization. To each basis for the vector space E, there corresponds 
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via h a family cr of sections o-i, . . . , such that for each x € U, the elements 
(7i(x), . . . , o-„(x) are a basis for E^. Thus every section s defined over U may be 
expressed in the form 



n 




where the at are scalar- valued functions defined on U. The family a is called a 
local basis over U. 

If T is another local basis over U, then for each x there exist scalars c,y(x) 
such that 

n 

i=l 

for 7 = We will refer to the matrix- valued function C = (c/y ) as the 

transition matrix from the local basis a to the local basis r. 

Since our intention is to replace the wave functions ^ of quantum mechan- 
ics by sections of a complex vector bundle over space- time we will restrict 

our attention in what follows mainly to vector bundles whose fiber is complex 
«-dimensional space C". In addition to the requirement that each fiber Ex be a 
complex vector space, we will further assume that each fiber Ex is supplied with 
a hermitian inner product { , )^, in such a way that the linear transformations 
4>i(x) : Ex are hermitian with respect to the standard inner product on 

C" defined by (z,w) = particular, this will imply that the modulus 

\s(x)\ of a section 5 at a point x £ X is a. well-defined real number; this will prove 
to be important in connection with the interpretation of the modulus squared of a 
quantum wave function at a point of space-time in terms of probabilities. 

When E is a hermitian vector bundle we will restrict our attention to orthonor- 
mal local bases a = {tJi, . . . , cr„}, i.e., to local bases over U which satisfy 

{(Ti(x),(7j{x))x = Sij 

for each x G C/. If r is another such local basis, then the transition matrices C(x) 
from (7 to T will be unitary matrices for each x eU. 

At this point it will be useful to make the relation between sections s of vector 
bundles and quantum wave functions 'F more explicit. Let E be a hermitian vector 
bundle of dimension 1 over space-time. Changing our notation slightly, we will 
denote a point in the base space by (x, t). It is known that every vector bundle 
over E” (or over any space which is contractible to a point, for that matter) is 
trivial, and hence we may consider local bases cr = [oi] defined over the whole 
of E"^. If (7 is such a local basis, with |cri| = 1, and if 5 is a section of E, then 
5 = where is a complex- valued function on E"^. Similarly, if r is another 
such local basis, then 5 = for some function 'F^. For each (x, t), the transition 
matrix C(x, t) from cri(x) to ri(x) is simply a complex number c(x, t) of modu- 
lus 1 . From 



= 5 = 'F^Ti = 'F^cai 
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we see that the relation between the two representatives of s is 

Moreover, since \c~^(x,t)\ = 1 for each (x,t) e it follows from the fact 
that is simply connected that c~^ = for some real- valued function / = 
f(xj). Thus (except for the factor l/h, which could be included by modifying 
/ slightly) we have recaptured the relation W of gauge equivalence of 

wave functions that was introduced in the first section. 

Of course, there is no connection so far with the Schrodinger equation or with 
electromagnetic forces. In order to achieve such a connection we must first discuss 
the concept of covariant derivative operators on vector bundles. 

18.3 

A fiber bundle is smooth if the total space, the base space and the fiber are smooth 
manifolds, and the projection n and the local coordinate mappings hi : 

Ui X F are smooth (i.e., C°°). 

Let F be a smooth real or complex vector bundle over R^. In order to write 
down a “Schrodinger equation” in which the unknown is a section s of E, we need 
some kind of partial derivative operators which can act on sections. Somewhat 
more generally, we would like to be able to speak of “the derivative of a section 
s with respect to a vector v G R"^.” One might be tempted to try to define such an 
operator just as one does for ordinary vector- valued functions, i.e., as the limit of 
a Newton quotient 



5(x + hv) — s{x) 
h 

as h tends to zero. However, there is a problem: the numerator of this expression 
is not well defined, since ^(x) and s(x -h hv) lie in different fibers. If we had some 
sort of connection between the fibers Ex and Ex+hv, at least for h sufficiently 
small, then we could use this connection to make the numerator a well-defined 
element of Ex. Letting h go to zero, we would then have obtained an element 
(traditionally denoted (Vys){x)) of Ex. This intuitive idea of a “connection” re- 
lating nearby (actually infinitesimally close) fibers is in fact one way of defining 
the so-called “covariant derivative operators” acting on sections of E. In fact, by 
abuse of language, covariant derivative operators are often referred to simply as 
“connections.” 

In practice, it is more convenient to approach the problem differently, by listing 
the properties which a first-order derivative operator on a smooth vector bundle 
ought to possess, and then using these properties to determine the form of all such 
operators. 

A covariant derivative operator on a smooth vector bundle F is a rule V which 
assigns to each vector field v on the base space X and each section s a section 
Vy5, linear in v and s, and which satisfies 
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(i) V/v5 = /Vv5; and 

(ii) V,(/5) = (Dv/)5 + /Vv5 

for all smooth scalar- valued functions on X. 

It follows from property (i) that for each x eX, (SI^s){x) depends only on the 
value of the vector field v at the point x. 

Let V be a covariant derivative on E, and let (t = {o-i, . . . ,o-„} be a local 
basis for E, defined over U C X. If s = /itxi + h then by the linear- 

ity of V and the Leibniz property (ii), is determined by the local sections 
VvtTi, . . . , Vv<T„. Moreover, for each j = 1, the sections VvCy are linear 
combinations of the sections Oi,i = 1, . . . n, and hence there exist scalars A^j(v) 
such that 



n 

Vv<Tj = ^ Ay(v)(7,'. 

/=1 

By the linearity in v, the scalars define differential 1 -forms on U\ the 
matrix may thus be regarded either as a matrix- valued 1-form or as a matrix 
of 1 -forms. 

If the section s is represented, relative to a local basis g, by the vector- valued 
function / = (/i, the section is represented by the function 

^v/ + A"(v)/. 

Let T = {ti, . . . T„} be another local basis for E, defined over V C X, and let 
C = {cij) be the transition matrix from a to r. Let A]^ denote the 1 -forms defined 
by VvTy = X"=i Then 

n n n n 

= X = X X ^ '^{CA(v))kj<7k- 

i=\ i=\ k=\ k=\ 

On the other hand, we also have 

n n n 

VyTy = Vy (^kj^k ~ A ^kj^v^k 

k=\ k=l k=l 

n n n 

= ^(D^Ckj)Gk + ^ Ckj ^ Af^(v)(T/ 

k=l k=l i=\ 

n 

= Y^{D,C + A\v)C)kjGk. 

k=\ 

It follows that on C/ n K the matrix- valued 1 -forms A^ and A^ are related by 



CA^ = A^C -h dC, 
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or finally, 



= C-^A‘^C + C~^dC, 

where dC is the matrix-valued 1 -form defined by (dC)(v) = DyC. 

In this way we have determined all covariant derivatives on E. Specifically, 
they correspond locally to operators of the form D where D is the ordi- 
nary derivative and is a matrix-valued 1-form, acting on column vectors by 
left multiplication, which transforms under a change of local basis according to 
the rule 



A^ = C-^A'^C + C~'dC. 



The forms { } are called the connection forms for the particular covariant deriva- 
tive. 

When E is a hermitian bundle (and this is the case with which we will be 
concerned) it is natural to restrict our attention to covariant derivatives V that are 
compatible with the inner product on E, i.e., that satisfy 



Dy{Si,S2) = {VySi,S2) + {Su'^vS2) 

for all sections s\ and 52 of E. For such operators we have for any orthonormal 
local basis c, 

0 = Dy{(7i, (Tj) = {Vy(7i, (7j} -|- ((7/, VyCiy) 

V=1 / \ //=1 

which implies that = -A^j for all / and y . Thus when V is compatible with the 
hermitian inner product on E, the matrices A^(v) are skew-hermitian; in particu- 
lar, when n = 1 they are pure imaginary numbers. 

We can use the above discussion to rewrite the Schrodinger equation for a 
charged particle in an electromagnetic field in terms of covariant derivative op- 
erators and sections. 

Let E be a hermitian line bundle over space-time Let {co,ci,C2, C3, } be 
the standard basis for so that (x, t) = xie\ + X2C2 + X3C3 + teo. Let a = {cti } 
be a local (unitary) basis for E over U since E is trivial we may assume 
that U = 

Fix a covariant derivative with connection form A^ on E, compatible with the 
hermitian structure, and consider the equation 

i^eoS=^K+^l+yi)s, ( 18 . 3 . 1 ) 

where the unknown 5 is a section of E. If we define real- valued functions Ai, A2, 
A3 and on by 
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-iqAi = -iqA 2 = A%e 2 ), 

-iqA^ = A^(c 3 ), and iq^ = A^(co), 

then the equation (18.3.1) goes over to the Schrodinger equation (18.1.1) (with h 
set equal to 1) for a particle with mass m and charge q, with T replaced by 
If we choose a different trivialization r, related to tr by t = cc, then since c and 
A^ commute, 



A^ = c ^A^c + c ^dc = A^-\-c ^dc. 

Letting / be a real-valued function such that c = this relation becomes = 
A^-iq df. In other words, the “potentials” A and ^ are replaced by A-hgrad / and 
<p- df /dt, respectively. Moreover, the unknown is replaced by 
Thus we see that the concepts of vector bundle, section and covariant derivative 
have led us to essentially the same relationships that arose in connection with the 
Schrodinger equation for a charged particle, but with connection forms and their 
transformation properties taking the place of vector and scalar potentials. 

However, there is still one element missing. In the context of vector bundles 
and covariant derivatives, what could possibly correspond to the electromagnetic 
field (E, B) itself? 

18.4 

One of the great achievements of the first part of the 20th century was Einstein’s 
development of the theory of general relativity, in which the force of gravity was 
understood to be an effect of the local curvature of space-time caused by the pres- 
ence of matter (or more precisely, matter-energy). It turns out that the electro- 
magnetic force can also be viewed as an effect of a kind of curvature. In order to 
understand this we must briefly recall the path by which one arrives at a useful and 
sufficiently general definition of the intrinsic (as opposed to extrinsic) curvature 
of a space at a point. 

Let M be an oriented surface in Euclidean space E^. For each point p e M, 
let N{p) denote the unit normal vector at x determined by the right-hand rule 
with respect to the orientation, and let N : M ^ be the mapping which 
assigns to each p the element N{p) e S'^. The Gaussian curvature K(p) of M at 
a point p is the Jacobian determinant of the mapping N at p. Thus the ratio of the 
variation of the normal vectors on a small neighborhood of p to the area of that 
neighborhood approximates K(p). Although it might appear from this definition 
that the Gaussian curvature of a surface in space is an extrinsic concept, which 
depends on the relation of the surface to the surrounding space, it can also be 
defined in a purely intrinsic fashion, using only the metric properties inherited by 
the surface from the Euclidean metric on E^. One way to do this is to consider 
a curvilinear triangle on the surface, whose sides are geodesic arcs, and compute 
the sum of the angles of the triangle. If this sum does not equal 180 degrees, 
it indicates the presence of curvature in the interior of the triangle. In another 
approach one parallel transports a vector that is tangent to the surface around a 
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closed loop; if the end result does not match the initial vector, this again indicates 
the presence of curvature inside the loop. (Of course, this presupposes that we 
can give an intrinsic meaning to the term “parallel transport” of a tangent vector 
on M.) 

Riemann found a far-reaching generalization of the concept of the curvature of 
a surface, valid for what are now known as Riemannian manifolds. A Rieman- 
nian manifold M is a smooth ^-dimensional manifold, each of whose tangent 
spaces TxM is supplied with an inner product {,)x which varies smoothly with 
X. A covariant derivative V on TM is said to be “compatible with the metric” if 
Dv(X, Y) = (VvX, Y) (X, Vv7) for all smooth vector fields X and Y on M. 

One begins by showing that on the tangent bundle TM of a Riemannian mani- 
fold there is a unique covariant derivative V such that 

(i) V is compatible with the metric on TM; and 

(ii) VxY - VyX = [X, Y] for all vector fields and 7 on M. 

Here [X, Y] is the usual commutator bracket of vector fields. Such a covariant 
derivative is often referred to as the “Levi-Civita connection” for the given Rie- 
mannian manifold. For example, for a surface embedded in this unique co- 
variant derivative can be shown to be the ordinary derivative D on R^, followed 
by orthogonal projection onto the surface. 

The Riemann curvature tensor R{p) is then defined by associating to each triple 
M, V and X of tangent vectors at p the tangent vector 



R{u, v)X = V,(Vv(X)) - Y,(Yn(X)) - 



Here m, v, and X are arbitrary extensions of w, v and X to smooth vector fields 
on a neighborhood of p. We will not explain how one arrives at this definition, 
except to say that, roughly speaking, the expression on the right-hand side repre- 
sents the difference between X and the vector X' obtained by parallel transport 
of X around an “infinitesimally small parallelogram” spanned by u and v, where 
parallel transport of a tangent vector X along a curve y(t) is defined by requiring 
that ^ 

It follows immediately from the definition of R(p) that it is linear in u, v and 
X, and it is not difficult to show with the aid of properties (i) and (ii) in the 
definition of a covariant derivative that it is independent of how we extend u, v 
and X to vector fields. Also, it is obvious that R(u,v)(X) = -R{v,u)(X). In 
other words, one may regard R(p) as an exterior 2-form on M whose values are 
linear transformations on TpM. 

The Riemann curvature tensor has an immediate generalization to vector bun- 
dles. Let be a vector bundle over a smooth manifold M, and let V be a covariant 
derivative on E. The curvature of V is the exterior 2-form on M, whose values 
are linear transformations on the fibers of E, defined by 



F(u,v)e = V,(Vv(5)) - Vv(V,(5)) - V[,,v](5). 
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Here e is an element of the fiber u and v are tangent vectors at x € M, w, v 
are vector fields which extend u and v, and 5 is a section of E such that 5 (x) = e. 
As before, this definition is independent of the choices of the extensions w, v and 
5. The geometric interpretation is essentially the same as when E is the tangent 
bundle of M, except that now it is a measure of the effect on an element e e E^ 
of parallel transport around an “infinitesimal parallelogram” in M, and where a 
section s is said to be parallel over a path y(t) in M if Yf(t)S = 0 for all t. 

Note that in contrast to the Riemann curvature tensor on a Riemannian manifold 
M, which is determined by the metric on M, the curvature now depends on the 
connection one chooses. 

Let (7 be a local basis for E, and let be the matrix, relative to the local basis 
O’, of the linear transformation e F(u,v)(e). By using the connection 1 -form 
of V to compute F(u, v)(cjj(x)) one finds that 

F^(u, v) = dA^(u, v) + (A^ A A^)(w, v). 

Here dA^ is the usual exterior derivative of the 1 -form and A^ A A"^ is the 
exterior product of A^ with itself, defined by (A^ A A^)(u,v) = A^{u)A^(v) - 
A^{v)A^(u). Note that in contrast to the case for scalar-valued forms, A^ A A^ is 
in general not zero, since matrix multiplication is non-commutative. However, for 
real or complex line bundles the connection forms have scalar values, so that in 
this case 



F^ = dA^. 

Let T be another local basis for E, and let C be the transition matrix from a to 
T. Then a rather tedious computation using the relation A^ = C~^A"^C + C~^dC 
shows that F^ is related to F^ by 

ft ^ c~^F<^c. 

In particular, when E is a line bundle, this shows that the curvature 2 -forms F^ 
are independent of the choice of local basis g. 

Let us work out the curvature forms F^ for the covariant derivative on a com- 
plex line bundle over defined in the previous section. With respect to the basis 
[€0,61,62,63] we have 

A^ = -iq{Aidxi + A2dxi + A3dx3 - 4>dt). 



Therefore, 



= dA^ = -iq 



I - — dx2 dx\ + - — dx3 dx\ + ——dt dx\ 
\ 0x2 0x3 dt 



+ 



d<p 

- — dx\ dt + -- — dx2 dt dx3 dt 

dx\ dx2 dx3 



i.e., 



F^ = -iq ((curl A)3dx\ dx2 + (curl A)2^X3 dx\ + (curl A)idx2 dx3 
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- (0+ dt- ^<p+ dx 2 dt- dx^dt^ . 

Since A and 4 > are vector and scalar potentials for the field (E, B), this shows that 

= -iq(E\dxi dt + E2dx2 dt 4 - E^dx^ dt + B\dx2 dx^ 

+ B2dX3 dx\ + ^3^X1 dX2). 



Since does not depend on a, this means that the coefficients of the curvature 
2 -form jp of V are just (except for the factor -iq) the components of the given 
electromagnetic field. Thus we have completed the geometric picture begun in 
the previous section: the vector and scalar potentials correspond to a covariant 
derivative, and the electromagnetic field corresponds to the curvature 2-form of 
the covariant derivative multiplied by iq. 

As for MaxwelTs equations, one finds by computing the exterior derivative 



dF^ = -iq 



( dE\ dE\ dE 2 

dx2 dxi dt + - — dx2 dx\ dt -I- - — dx\ dx2 dt 

\ dx 2 0x2 OX\ 



dB2 dB^ dB^ 

H- • • • + dt dx3 dxi -h - — dx3 dx\ dx2 + ^^dt dx\ dx2 

dt 0x3 dt 



that the two homogeneous equations 

dB 

di\B = 0 and curl E = — — 

dt 

are equivalent to the single equation dF^ = 0. 

The remaining two equations 

dE 

divE = p and curl B = j + 

dt 

are equivalent to the equation 3 F^ = -iqJ^, where 5 is the adjoint of d with 
respect to the metric on forms on space-time and = j\ dx\ -\-j2dx2 -hjsdx^ - 
pdt is the “four-current” density. This may be shown by using the fact that 6 = 
where * is the Hodge star operator, defined for differential forms on space- 
time by 



:^dx] = -dx2dxsdt, ^dx\dx2 = -dx^dt, ^dx\dx2dx^ = -dt, 
^t = -dx\dx2dx3, ^dtdx\ = -dx2dx^, ^dtdx\dx2 = -dx^. 



with the remaining formulas being obtained by cyclic permutation of x, y, and z. 



18.5 

Let us reflect a bit on the results of the previous section. With the aid of the 
concepts of vector bundles and sections, we have found an interesting geomet- 
ric interpretation of electromagnetic potentials and forces in terms of covariant 
derivatives and their curvature. 
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One implication of this is the following. Let V be a covariant derivative on a 
hermitian line bundle E over As we have seen, if 5 is a section of E which 
satisfies a Schrodinger equation Neo^ = ^ 

each local trivialization a of E, the representative of s satisfies the equation 
i{d/dt + = (-l/2m)(grad-/A)^T^, where A, = iA^{et), i = 1,2,3, and 

^ = -iA^(eo). Since this equation is the Schrodinger equation for a particle of 
unit charge moving in an electromagnetic field with vector and scalar potentials A 
and 0, we see that the geometric point of view contains within itself the necessity 
of electromagnetic potentials. 

Of course, the physical reality of electromagnetism can only be demonstrated 
by experiment; geometry and physics occupy different domains. Still, there is an- 
other consideration which should not be overlooked. As we have already noted, 
the gravitational force can be viewed as an effect of the curvature of space-time 
due to the presence of matter. There is a covariant derivative involved here also, 
namely, the Levi-Civita covariant derivative associated with the Lorentz metric 
on the space-time manifold M. In light of this, the fact that the other fundamental 
physical force which we experience directly can be expressed in terms of the cur- 
vature of a covariant derivative is rather startling. Among other things, it suggests 
the possibility that a unified theory of electromagnetism and gravity may someday 
be found. 

Remark. It may be appropriate to mention at this point that the term “gauge,” 
as in “choice of gauge,” has its origin in a suggestion made in 1919 by Hermann 
Weyl. Weyl thought that perhaps one should relax the condition on the covari- 
ant derivative on the space-time manifold M slightly and allow the possibility 
that the lengths of tangent vectors could change under parallel transport, with the 
consequence that there would be no fixed gauge (Eich, in German) or standard 
of length. Einstein objected that if this were the case, then hydrogen atoms with 
different histories would not have identical spectra, and the idea was temporarily 
abandoned. A few years later, with the appearance of quantum wave functions 
T', Weyl and others realized that his idea that arbitrary gauge changes be allowed 
could be applied instead to the phases 9{x, t) of a quantum wave function T. 

There are implications in the idea of allowing arbitrary gauge changes in quan- 
tum wave functions which go well beyond electromagnetism. Recall that there are 
two other fundamental forces in nature, the weak nuclear force, which is respon- 
sible for ^-decay, and the strong nuclear force, which holds the nucleus together. 
Is it possible that these forces can be interpreted in terms of connections and cur- 
vature as well? 

First, consider the weak nuclear force. This force does not “see” the difference 
between protons and neutrons. Heisenberg had suggested in 1932 that perhaps the 
proton and neutron were just different states of a single “nucleon,” analogous to 
the “spin up” and “spin down” states of the electron. If this is the case, then the 
wave function T for the nucleon could be regarded as a pair of complex-valued 
functions ('Fi,'F 2 ), with being the relative probability of finding the 

nucleon at (x, t) in the “proton state” and |'F 2 (x, being the relative probability 
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Figure 18.3: Isotopic spin 

of finding it at (x, t) in the “neutron state.” In this way T could be viewed as 
a function on space-time with values in two-dimensional complex space C^. Of 
course, which state one chooses to call the proton state and which to call the 
neutron state is rather arbitrary, but it was assumed that once one had made such a 
choice at one point (x, t), then there would be no freedom of choice left for other 
points. 

In 1954 Yang and Mills proposed that, in analogy with the situation in electro- 
dynamics, one should be able make the choice of proton and neutron states in an 
arbitrary way, without being restricted by the choice made at one particular point 
(see Figure 18.3, in which the situations for wave functions with values in C and 
in are illustrated). Thus from their viewpoint the “isotopic spin axes” in the 
space could be chosen arbitrarily. 

To put this another way, since the metric on is invariant under SU(2), the 
wave functions T and g¥, where g = g(x, t) is a function whose values are ele- 
ments of the special unitary group SU(2), should be regarded as representing the 
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same physical state. This assumption leads, just as in electrodynamics, directly to 
the understanding that the wave function T should be replaced by a section 5 of a 
trivial hermitian vector bundle E of dimension 2 over 
For each choice cr = {tTi,(j 2 } of a basis for E, s will be represented by a 
function : R"^ ^ C^; if r is another trivialization, then where 

C = C(x, t) is the 2 X 2 transition matrix which relates r to a. The Schrodinger 
equation for 5 , which can be written in terms of a covariant derivative V on E, 
corresponds via a to a. Schrodinger equation for in which the ordinary deriva- 
tive D is replaced by D -I- the connection forms now being 1 -forms on 
R"^ whose values are 2 x 2 traceless skew-hemitian matrices, i.e., elements of the 
Lie algebra of SU(2). The field A^, which may be thought of as a “generalized 
potential” for the weak force, may be written in the form where 

the A^ are real- valued 1 -forms on R"^ and {^ 1 , <^ 2 , ^ 3 } is a basis for the Lie algebra 
of SU (2); one usually takes the to be the Pauli matrices multiplied by /, so that 

«'='(i 0) *='(? 0) -1)^ 

Thus A^ may be considered as a field on space-time with 12 components, in 
contrast to the four components of the potentials (A, of electromagnetism. The 
transformation law which relates potentials relative to different trivializations of 
E is just as worked out in Section 18.3, namely 



Here the transition matrix (or “gauge change”) may be written as 

C = exp ( ^ Ca 

\a=l 

where the are real- valued functions on R"^. 

The weak force then appears as the curvature 2-form F of V. Thus with respect 
to a trivialization cr of E, the weak force will be represented by a 2 -form = 
dF^ A^ A A^ whose values are 2 x 2 traceless skew-hermitian matrices, with the 
transformation law 



We leave it to the reader to work out the components F^ of F^ in terms of the 
components A^ of the potential. In terms of real- valued functions on R'^, the weak 
force is a field with 18 components, as compared with the six components of 
electromagnetic fields. 

One would also like to determine the field equations for this force, i.e., the 
analogue of Maxwell’s equations. Unlike the case of hermitian line bundles, it 
is not true for vector bundles of dimension greater than 1 that dF^ = 0. This 
equation must be replaced instead by the equation 

dF^ + A^AF^-F^AA^ = 0, 
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18. R Shanahan: Schr5dinger’s Equation and Gauge Theory 



which may be written as 



+ = 0, (18.5.1) 

where [ , ] is the commutator bracket of matrices. It is not difficult to verify that 
this equation is a simple consequence of the relation = dA^ + A A^. (To 
geometers it is known as the Bianchi identity for curvature.) 

If we define the covariant differential associated to A^ by d"^"" = d + [A^, ], 
then (18.5.1) becomes simply 

= 0. (18.5.1') 

In addition to the Bianchi identity, the 2-form satisfies an equation analo- 
gous to the equation 8F^ = which comprises the non-homogeneous pair of 
Maxwell’s equations in the case of electromagnetism. However, in contrast to the 
situation in electromagnetic theory, where Maxwell’s equations were known in 
advance, we have no such guide in the case of the weak force. The approach used 
by Yang and Mills was to appeal to a variational principle for the action den- 
sity \F^^ = F^ A That this is a reasonable choice for an action density is 
based partly on the fact that when F^ represents the electromagnetic force, then 
F^ A - B^)dx\dx 2 dx 2 >dt, which is the action density for the source- 

free electromagnetic field, and partly on the fact that there are few other choices 
that have the correct tensorial properties. By applying Hamilton’s principle in the 
usual way, one arrives at the equation 

5^^F^ = 0, (18.5.2) 

where = * d^^" * is the adjoint of d^\ The equation (18.5.2) is known as the 
Yang-Mills equation. (When a source term is included in the action density, the 
Yang-Mills equation takes the form F^ = where = J\dxi + j 2 dx 2 + 
jxdx 3 - pdt is a 1-form with values in the Lie algebra of SU(2), with p being 
identified as the “charge” density and j as the “current density” associated to the 
source of the weak force field.) 

Today it is understood that the proton and neutron are not fundamental particles, 
but are made up of quarks, as are other particles such as pions and kaons. The 
“isospin states” of quarks relative to the weak nuclear force are referred to as 
“flavors”; these occur in three different pairs, “up-down,” “strange-charm” and 
“top-bottom.” The quantum theory of each of these pairs is an YC/ (2) gauge theory 
as described above. 

Relative to the strong nuclear force, quarks have three isospin states, collec- 
tively referred to as “color.” The color states are conventionally termed “red,” 
“green” and “blue.” The wave function for quarks which make up the proton and 
the neutron (each of which consist of three quarks) is regarded as a section of 
a hermitian bundle E of dimension 3. Thus, with respect to a trivialization o of 
E, the wave function s is represented by a function = (¥p¥ 2 ,'Fp, where 
|T^^(x, 01^, 1 ^ 2 (x, 01^ and \^^{x,t)\^ are the relative probabilities of finding the 
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quark at (x, t) with color red, green or blue, respectively. However, except for the 
fact that the gauge group is now SU(3), and the connection and curvature forms 
take their values in the Lie algebra of SU{3), i.e, in the space of 3 x 3 traceless 
skew-hermitian matrices, the geometric formalism we have discussed in connec- 
tion with the weak force goes over without any change to the strong force. 

To conclude, the object of this talk has been to make the case that the geometric 
point of view, which makes use of the concepts of sections of vector bundles, 
connections and curvature, has a significant role to play in the development of the 
physical theory of the fundamental forces of nature. 
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Appendix A 

Some Further Electron Configurations 



Electron configurations of a few light atoms in their ground states were presented 
in Table 10.2. We present here a few more ground state configurations. 

The number of protons p and neutrons n in the nucleus, and the shell struc- 
tures are presented in the following schematics for atoms with atomic numbers 
Z, where 1 < Z < 18. 



Atom 


Symbol 


Atomic 
Number Z 


Electron Configuration 


Nitrogen 


N 


7 


Is^ 2s^ 2p^ 


Oxygen 


0 


8 


Is^ 2s^ 2p^ 


Sodium 


Na 


11 


\s^ 2s^ 2p^ 35 ^ 


Magnesium 


Mg 


12 


Is^ 2s^ 2/ 3s^ 


Aluminum 


A1 


13 


Is^ 2s^ 2p^ 35^ 3p^ 


Silicon 


Si 


14 


l5^ 25^ 2p^ 3s^ 3 p^ 


Phosphorous 


P 


15 


Is^ 2s^ 2p^ 3s^ 3p^ 


Sulfur 


S 


16 


\s^ 2s^ 2/ 35^ 3/ 


Chlorine 


Cl 


17 


l5^ 25^ 2/ 3^2 3/ 


Argon 


Ar 


18 


1^2 2s^ 2/ 3^2 3/ 


Potassium 


K 


19 


l5^ 2s^ 2p^ 3s^ 3p^ 45^ 


Calcium 


Ca 


20 


l5^ 25^ 2p^ 3s^ 3p^ 


Scandium 


Sc 


21 


\s^ 2s^ 2p^ 3s^ 3p^ 3d^ As^ 



Table A.l: More ground state configurations 






Figure A.l: Electron c 








Appendix B 

Mendeleev Periodic Table 



Some brief remarks concerning the general set up of the periodic table appear in 
Section 10.3. Here the name, symbol, atomic number, and electron configuration 
of an element is listed. For example, in the table where the box 





2 


chromium 


8 


Cr 


13 


24 


1 



appears, the element is chromium with symbol Cr, atomic number Z = 24, and 
which has 2 electrons in the K shell, 8 electrons in the L shell, 13 electrons in the 
M shell, and 1 electron in the N shell. 
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Transition elements 



scandium 

Sc 

21 


2 

8 

9 

2 

~ 


titanium 

Ti 

22 


2 

8 

10 

2 


vanadium 

V 

23 


2 

8 

11 

2 


:hromium 

Cr 

24 


2 

8 « 
13 
1 


langanese 

Mn 

25 


2 

8 

13 

2 


Fe 

26 


2 

8 

14 

2 


cobalt 

Co 

27 












~r 




~r 








~r 




yttrium 


8 


drconium 


8 


niobium 


8 


molyb- 

denum 


8 


techne- 


8 1 


uthenium 


8 


rhodium 


Y 


18 

9 


Zr 


18 

10 


Nb 


18 

12 


Mo 


18 

13 


Tc 


18 

13 


Ru 


18 

15 


Rh 


39 


2 


40 


2 


41 


2 


42 


1 


43 


2 


44 


1 


45 



174.97 8 hafnium 

lutetium \l Hf 

Lu ? 72 



Re Os 

75 76 



32 Ku 2 Ha 
Lw 104 105 

103 2 



Figure B.l: Metals 




Appendix C 

Determinants for String World-Sheets 
That Are Tori: Another Example of 
Zeta Regularization 



In Chapter 1 1 (and other chapters) we have considered path integrals — “sums 
over histories or paths” — for particles. In string theory, a particle is replaced 
by a string and instead of a path integral one considers a “sum over surfaces” 
(a Polyakov integral), since a string moving in d-dimensional space-time = 
X R^"^ sweeps out a surface X, called a world-sheet. One also sums over Rie- 
mannian metrics / on X and smooth maps O: X ^ R^. The sum over O, with 
(X, y) fixed, is easy to deal with and can be given a mathematical meaning since 
it essentially reduces to [det(— where A(y)' is the restriction of the 
Laplace-Beltrami operator A(/) of (X,/) (see (16.1.4)) to the orthogonal com- 
plement of the kernel of A(y). That is, formally, det(-A(/)') = product of non- 
zero eigenvalues of -A(}^), which can be made precise by the zeta-regularization 
method of Chapter 15. Here we take d = 26, the so-called critical dimension. 

We show how to compute this determinant in a concrete example. Namely we 

def 

consider a (Bosonic) string whose world-sheet is a complex torus: X = X^ = C 
where is the lattice [a + hr \ a,b £1,], where r = ti -h /t 2 is a fixed number 
in the upper 1 /2-plane : T 2 > 0. The Euclidean Laplacian A = ^ -h ^ on 
C = R^ commutes with translations by points of R^ (by the chain rule), and in 
particular with translations by points of L^. Thus A projects to an elliptic differen- 
tial operator A^ on X^, in the same way that the Laplacian AonG/K projected to 
the operator Ay onT\G/ K in Chapter 16. One knows that the canonical metric 
on R^ is the pull-back of a metric / on X^ whose Laplace-Beltrami operator A(x) 
is exactly A^. The spectrum of A^ is known and is given as follows. For integers 
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Appendix C. Another Example of Zeta Regularization 



m, n define 4>mn on C by 

_ n x 

for z = X + iy, r = ti + iri- Then one has immediately that 

~^4^mn ~ ^mn4^mn for 

^mn *= ^\m + nrp > 0. 

^2 



(C.1.1) 



(C.1.2) 



Now (prnn C ^ T = {zgC| \z\ = l}isa homomorphism which is trivial on 
Lr. Therefore 4>mn projects to a homomorphism 4>mn ~^T which satisfies 

—A.T;^mn — ^mn^mn' (C.1.3) 



It is known that { Xrnn } ^,„eZ the (multiplicity-free) spectrum of - on (L^). 
By (C.1.2), kmn = 0 m,n = 0. Therefore the spectral zeta function C-Aiyy 
of-A(y)' = -A'^ (cf. (15.1.6)) is given by 



where 



c-aw(5)'i' 2 T 



irn,n)m0) 
by (C.1.2) ^2 






y :t=E*{s,t) 

^ |m -f 



(m,«)5^(0,0) 



(C.1.4) 



(C.1.5) 



is a standard Eisenstein series which is holomorphic in 5 for Re^ > 1 and which 
admits an explicit meromorphic continuation to C; cf. [89], Chapter 15 for ex- 
ample. The only singularity of 5 ^ E*{s,r) is a simple pole at 5 = 1 (with 
residue |). Thus by (C.1.4), (C.1.5) C-a(/)s defined initially for Re^ > 1, extends 
meromorphically (and explicitly) to C and, in particular, is holomorphic at 5 = 0, 
which means that definition (15.1.9) applies: 

det(-A;) (C. 1 .6) 

By (C.1.4), (C.1.5) one therefore needs the value ^(0, r). But this value is 
indeed known, a result which amounts to the classical Kronecker limit formula. 
Namely, let ^(r) denote the Dedekind eta function on 

00 

;?(t) = JJ(1 - " O- (C.1.7) 

n=l 



Then 
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Theorem C.l. 

dE* 

L=o= - \og4}r^Imt\riit)\*. 

OS 

It follows that (again for T 2 = Imr) 

Q(0) = - logr2|//(r)|^ =» det(-A;) = 4\r,(T)\\ (C.1.8) 

Thus we obtain the contribution [det(— for d - 26, of 
the sum over “embeddings” O : Z ^ in the Polyakov integral, in case Z is 
the complex torus Z^ = C/L^. This one-loop contribution was found by Joseph 
Polchinski (and others). One can view 26 as 24 + 2 where is a cusp form 
of weight 12 (in the theory of automorphic forms), the smallest weight possible 
for the existence of cusp forms (for SL{2, Z)). The “2” represents two degrees of 
freedom in a certain context. One also arrives at the dimension d = 26 = 24 + 2 
via the spectral analysis of a Bosonic string (and by many other ways) where the 
special value C(~l) = of the Riemann zeta function f comes into play. That 
is, it can be shown that Lorentz invariance of the string requires that ^^(-1) = 
-1, which forces d = 26. 




Appendix D 

Evaluation of the Integral 
r _ r r 

“ Jr« Jm« \x-y\^-i^ clxay 



Although we need only the value of the integral In in case n = 3 and = 2, to 
complete the analysis in Chapter 10, we indicate how to compute /„ in general. 
Here a,/) > 0 with P < n. 

Consider the function 



0(x) = onEnorA>0,^GR. (D.1.1) 

\x\i^(\x\^+Ay-T 

We claim that O G dx) for -1 < ^ < n. To see this we use polar coordi- 

dcf 

nates. For x G R" - {0} , write x = ra, where r = jxj > 0, cr = G (0) = {y G 
I bl = !}• Then Lebesgue measure dx assumes the form dx = r^~^drda. We 
need also that the surface area of S\ (0) is given by 



(Jn = 



2;r” 



(D.1.2) 



and that (by [40, p. 295]) 



^{A + yy < 00 for jw > 0, v + ^ < 1, A > 0; (D.1.3) 

.0 2 



the value of the integral in (D.1.3) is — B 



0(x)dx = 



’ poo 

. . Si ( O ) Jo 



r"-^drd(j 
r^r^ + A)^ 



poo 

Jo 



= or, 



r" ^ ^(A + r^) ^"^^^dr<oo 



(D.1.4) 
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by (D.1.3) for n > j? > -1, which proves the claim. 

For / e L^(W,dx), we take the following definition for its Fourier trans- 
form /: 



/(x)= e^^-^f(y)dy; 



(D.1.5) 



cf. definition (6.2.20). The following result is standard; see [34] for example. 



Theorem D.l. (i) For /(x) = e a > 0, f{x) is given by 

, ac„ , i2nrr{!^) 

/w = ^ for Cn = ^ . 

(a2 + |xp)^ 



(D.l. 6) 



(ii) For g(x) =* c fi > 0 



, /Jt\2 

g(^) = e « . 



(D.1.7) 



(iii) For any f,gs L’(R",dx) 



f(x)g(x)dx=\ f{x)g{x)dx. 



(D.1.8) follows inmiediately by Fubini’s theorem. 

Define 

f(x) = for a > 0 (as in (i) of Theorem D.l), and 

- 1 < )^ < «• 

\x\P 



(D.1.8) 



(D.l. 9) 



The main computation of this appendix is a Fourier transform formula for h, for 
0 < P < n. Note first that for the choice A = or^ we have that h = crc„0 by 
definition (D.1.1) and Theorem D.l. As we have shown in equation (D.l. 4) that 
O G L^(W^,dx) for -1 < ^ < « we see that h is indeed well defined: h G 
L\W,dx): 

Theorem D.2. For h given by definition (D.l. 9) one has that h G L^(R",dx). 
Moreover, the following formula holds for 0 < /? < «.* 

I 9?''^ (D.l.lO) 

Indeed the integral in (D.l.lO) exists for a,p > 0. 

Proof Apply the formula for the gamma function 



e ^dt = for Refi, v > 0 : 
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e-M'’t-2-^dt = foTy^0,l}> 0 . 



(D.1.12) 



Forc(j?) = r()?/2),0</?<n 



ciPMx) = c{p) e'^'^h{y)dy 



= m\ e'^y^-^dy 



(D.1.13) 



(D.1.12) r® 

Jo 



e^^yf{y)e-\y\"U"-^dtdy. 



Here we wish to apply Fubini’s theorem. Thus note that 



n j. 

-dy 

\ r 

Jm" Jo ^ 



(D.1.14) 



by (D.1.12) 



c()9)^Y^dy = c(0) \h{y)\dy < oo, 

' |yr Jr" 



where dtft = Haar measure on (0, oo), so that Fubini’s theorem does apply: 

c{p)h{x)=\ [ e-'y^"*e‘^yf{y)dyt2-'dt. (D.1.15) 

Jo Jl" 

For fb(x) /(x + b), b £ E", fb(x) = as in (iii) of Theorem 6.4, by 

the translation invariance of dy: 



Mx)=\ e^^-^f(y + b)dy=\ e^^-^y-^^f{y)dy 

. R” Jr" 



(D.1.16) 



= e-'"V(x). 

We can therefore write, for gt{y) 



[ e-\y^"'e'^yf(y)dy 

Jr« 

= [ gtiy)f-xiy)dy 

Jr” 



by Theorem D.l(iii) 



gt{y)f-x{y)dy (D.1.17) 



by Theorems. l(ii) ( TtV'!'^ f _l>£ , , , . 

= - 6 4, f_,(y)dy, 

^ ^ JR" 

which plugged into D.1.15 gives 



- r f _]>£ /;r\«/2 p, 

c{P)lt{x)=\ e 4' /(y - x)dy ( - ) 'dt. (D.1.18) 

Jo Jm" ^ ^ ^ 
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Here we wish to apply Fubini’s theorem a second time: 



= l/(y 

JR" 






\ p dt ^ 




t^ —dy 
t 




■00 9 

-bl^ P n 


dt 


g 41 /2 2 


—dy 


0 


t 


r zbA 


_p dt 



(D.1.19) 



'^7 n \ M d>f£ , 

= \f(y-x)\ e 4 2 dy 

JR" Jo ^ 

12/ Jr- ij-r' 

On the other hand, |y| = |y - x + x| < |y - x| + |x| => 

def(D.i.9) |/(y-x)| = /(j-x) (sincc/ > 0) < =» 

(by polar coordinates again) 



\yr<^ 



00 g-ar^«-l 



(D.1.20) 



.Si(O) Jo ^ 



'00 

Jo 



for a, P > 0\ cf.DA.il again. 
That is. 



r lyl"-'* 



■dy <00 for a,P > 0, 



(D.1.21) 



which establishes the last sentence in the statement of Theorem D.2, and by 
(D.1.19), (D.1.20) we can legitimately apply Fubini’s theorem: 



^ n -br l_n_i 

c(P)h(x) = f{y-x)jt^ 6 2 ^dtdy 



by (a 1.19) 2 fn-p\ 

nn-p f ^ 



=■ 

which proves Theorem D.2 
Corollary D.l. For a > 0, 0 < fi < n, 



t" 



(D.l. 22) 



is a continuous function on EA which vanishes at infinity. 
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Corollary D.l follows since the function in (D.1.23) is the Fourier transform of 
an integrable function on R", namely the function c{n, P)h for a suitable constant 
c(n, /?). 

As 






OtCn 



n+l 

2 



+ Ixp) 

(by Theorem D.l) we can write (by definition (D.l. 9)) 

Ky) = ^ 

|yl^(a2 + |y|2) 2 

and then express Theorem D.2 by the equation 
acndy 



0 






' dil3) 









■dy = dip) 



-abl 



(D.l. 24) 



(D.1.25) 



(D.l. 26) 






dy 



for d(P) ^F(^). Multiply both sides of (D.1.26) by e “'^1 and integrate 

with respect to x: 



dip) 

where 



^-a(|xl+|y|) 

^Jydx = «c„r 

t. ly - x|"-P V 2 



)\ \ — 

/ Je" Jr” |v|^(cr^ 



,ix-y^-a\x\ 



\y\Ha^ + bP)^ 



-dydx, 

(D.1.27) 



[ [ 


^ix-y^-a\x\ 


Jr« Jr" 


/ \ ^ 

bP («" + bP) ^ 



dxdy 






R« 






-a|xl 



-dxdy 



" \y\^ («^ + |yP)' 

I ^{y)dy < oo 

Jr« 



(D.1.28) 



by (D.l. 4), for the choice A = cr^ in (D.1.1). That is, by Fubini’s theorem, 
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p-a(l^l+lrl) 



f f 

Jr« Jr« 



\y-xr 



gt ’‘■yg-aM 



by def (D. 



K" \y\^ («2 + |yp) ^ 

f(D.i.9) f(y)dy 

= ac„r - 

\^ / JR" 



-dxdy 



by(D.I,24) 2 2r(P\ 



by polar coordinates 



W («^ + bP) ' 
dy 

r \yfW + bP)""^' 

r"-^dr 



2 2r(p\ r 

« c„r - «r„ 

\^/ J( 



r^{a^ + r2)"+i 
n — P n P + 2^ 



(D.1.29) 



22r^fP\ («^) ' o /^n-/? n + ^ + 2\ 

= « «.r ^5 j ^ j , 

by the remark following (D.1.3). As B(x,y) = r(x)r(y)/r(x + y) for x,y > 0 
we get by definition of d(P), Cn, and and by (D.1.29) (see (D.1.2), (D.1.6)) 

n-p\ a-"-W'T(2±i)V(f)2;rtr(2i^)r(2±f±^) 









2 ;r"+>r(f)r(M+ 1 ) 



(D.1.30) 



or 



20+n 



h = 



(?)r(=f‘) 



'(=^) 



a"+^ r(f)«! 



FinaUy as r(j: + 1) = zH?:) we can write F and con- 

elude that 



2"+^-i ;r"-i (n -I- ^)F ( f ) F ( 2^ ) F ( 2^ 
a”+^F(2)«! 



(D.1.31) 



for a > 0, 0 < /? < «, which proves Theorem 10.1 in Appendix lOA. 




Appendix E 

Some Informal Comments on QFT 



Remarks in the Preface and at the beginning of Chapter 1 1 point to the contrasting 
nature of Parts I and II. As the latter part involves more technicalities, in some 
sense, with less exposition, and some casual mention of quantum fields, we offer 
here a few informal comments which might serve to ease the progression, at least 
slightly, to those latter chapters. 

The wave function is a central notion in the quantum mechanical description 
of a single particle (or multi-particle) system. The integral of its modulus squared 
over a region is interpreted as the probability of finding the particle in that region. 
We have also considered Feynman’s “sum over histories,” path integral approach 
to quantum mechanics and its probabilistic meaning. In the discussion of Heisen- 
berg’s uncertainty principle in Chapter 6, the decline of determinism with the rise 
of quantum mechanics was noted: the behavior of particles at the fundamental 
level is unpredictable and probabilistic. 

In quantum field theory (QFT), the basic entities are fields rather than parti- 
cles. As we have not attempted to define what a particle is, we will not define a 
field. Particles and fields in fact are not necessarily mathematical concepts, but 
are physical notions. A field, generally speaking, can be thought of, or repre- 
sented by, some function 0 on some space-time (or some section of a bundle 
over some space-time). The dynamics of the field is determined by some action 
functional S{^) given by integration of some Lagrangian density L involving (p 
and its derivatives. A variational principle yields the equation of motion of the 
field, similarly as one obtains the Euler-Lagrange equation of motion of a parti- 
cle: One sets the variation of S(4>) with respect to p (a Frechet derivative) equal 
to zero; compare the remarks following equation (2.7.20). Solutions of these field 
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equations can represent particles — quantized oscillations of the field, where var- 
ious properties of these particles correspond to various modes of oscillation. The 
photons discussed in Chapter 1, for example, correspond to oscillations of the 
electromagnetic field, where as the Higgs particle (or Higgs boson, assuming it 
exists) corresponds to oscillation of a Higgs field (after Peter Higgs). Recall from 
Chapter 10 that bosons are particles with synunetric wave functions. 

The simplest quantum field equation is the Klein-Gordon equation 

(a + m^)4>^0 (E.1.1) 



for a scalar field on Minkowski space-time where □ is the Laplace- 

Beltrami operator of the Lorentzian metric 



1 



g = 



0 



-1 



0 

-1 



(E.1.2) 



on R"*. If one labels the coordinates on by (xo, xi, X 2 , X 3 ) = (f, x, y, z) (where 
t represents time), then by definition 16.1.4 one immediately computes that 



□ 





for 






dx^ dy^ dz^ 



Equation (E.1.1) is therefore the equation 



(E.1.3) 



(E.1.4) 



(E.1.5) 



An appropriate Lagrangian density whose variation of the corresponding action 
with respect to yields equation (E.1.5) is 



1 ^ m^ 

L(fi) = -J^djfid^4>-—4>, 



(E.1.6) 



j=o 



where d; = ^, d'' = 2, 

The parameter m in (E.1.1) can be thought of as the mass of some particle 
but, strictly speaking, only after the field ^ has been quantized so that some 
particle interpretation of it can be considered. This quantization process, ealled 
second quantization (better terminology for this could have been invented) in- 
volves regarding ^ as an operator subject to commutation rules analogous to the 
Heisenberg-Bom-Jordan rules considered earlier, where it is necessary, first of 
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all, to construct a field momentum operator conjugate to The latter construction 
is done via of functional differentiation of L with respect to the time derivative of 
4), which is the analogue of definition (2.7.13). 

The Klein-Gordon equation was introduced originally as the analogue of Schrd- 
dinger’s equation for the quantum mechanical study of relativistic particles. How- 
ever, in this attempt two drawbacks emerge. One of them is that one loses the 
probabilistic meaning of the integral over a region of the modulus squared of a 
solution ^ (in contrast to the initial remarks of this appendix). Here we refer to 
the relativistic version of the “modulus squared,” which is no longer necessarily 
positive definite. A second drawback is that some (p's might yield negative rel- 
ativistic energy values. For example, fix /c = (/ci, ^2, ^3) ^ and m g R. For 
V = (xi, X2, X3) e R^ define p on R^ by 

P(t,v) = (E.1.7) 

Then for \k\^ = k • k = 

= (E.1.8) 

which means that ^ is a solution of the Klein-Gordon equation (equation (E.1.5)) 
if and only if co^ = lx|^ -f m^. On the other hand, the energy of the plane wave 
(E.1.7) turns out to be -co - -(Ixp + One would need to find some 

meaning of a negative energy particle. A proper interpretation of equation (E.1.1) 
is that it describes the field distribution of certain spin-zero particles. Following 
the consideration of the Klein-Gordon equation, for spin-0 fields, one may move 
on to consider Dirac’s equation for spin-^ fields. Maxwell and Proca equations for 
spin-1 fields, etc. We note that fields are characterized by a continuum of degrees 
of freedom contrary to systems of particles. 

It is of interest to consider quantum fields at some finite temperature, a situation 
that arose in Chapter 16, and in particular to consider quantum partition functions. 

Recall from Chapter 14 the significance of such functions, and the Euclidean 
path integral representation given in (14.1.2) by “integration on the diagonal” of 
the density matrix p. Our remarks here provide for a further perspective on that 
chapter — a chapter which served in a crucial way as motivation for some of the 
more difficult ideas of Chapter 16. In Theorem 14.3, for example, the logarithm 
of the partition function (i.e., the free energy of a harmonic oscillator, up to a 
constant) was expressed in terms of a suitable zeta function. That theorem helps 
us in a crucial way to realize that indeed in quite more difficult situations, such 
as the consideration of partition functions Z associated to quantum fields over 
some curved space-time, one should expect a zeta function to play a key role — 
especially as such Zs are also given by suitable path integrals. By this line of 
thought one understands why we called definition (16.1.18) the main definition 
of Chapter 16. 

For physical systems consisting of a great number of interacting atoms whose 
states therefore cannot be specified exactly, one attempts to employ for their study 
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statistical mechanical methods. In a copper coin of mass 4 grams, for example, 
there are 4 x 10^^ atoms. Clearly it would be impossible (even with a sophisti- 
cated computer) to keep track of the interactions of these atoms. Instead of one 
attempting to pin down the behavior of individual particles in a complex system, 
the better procedure is to calculate their average, or most probable behavior. Thus 
the basic idea is that physical properties of such a system are to be thought of as 
some kind of statistical average which one calculates over an ensemble of consid- 
erably smaller states, called microstates, of that system. 

Consider a system at a given temperature T which interacts with some external 
heat bath or thermal reservoir. The energy of the total closed system consisting 
of the system plus the bath will have a constant value, although the energy of the 
system itself will vary as it assumes the various values Ej of its microstates. What 
is the probability pj of finding our system in a specific microstate Sj with energy 
value Ejl The answer is known to be given by the equation 

Zpj=e~^^i (E.1.9) 

where ^ ^ for a suitable constant *: (Boltzmann’s constant, as in (14.1.1)), 

and where 



z = = (E.1.10) 

j 

is Boltzamann’s canonical partition function. The letter Z is commonly used to 
denote this function, Z being the first letter in the german word Zustandssumme — 
the sum in (E.1.10) being taking over all of the microstates Sj. From a given 
partition function Z one derives the other thermodynamic quantities. For example 
the free energy of the system is given by 

F = F(T) = -x-riogZ, (E.1.11) 



and the internal energy is given by 

kT^ dZ 1 op 



(E.1.12) 



The last statement of equality follows from equation (E.1.9), and it shows that 
we can regard U as the mean energy of the system; compare this remark with 
equation (3.9.10) and compare equations (E.1.11) and (E.1.12) with equations 
(14.1.3) and (14.1.7). We also have the entropy 



S - = K 

dT 

U 



^ 1 ^ 

rj-tiogz 



= — + /C log Z - KpU -t- K log Z 



(compare this with (14.1.5)), and we may conclude that 



(E.1.13) 



F = U-TS, 



(E.1.14) 
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as in (14.1.6). 

A quantum mechanical analogue of the Boltzmann partition function Z in 
(E.1.10) is the quantum partition function (at a fixed temperature) 

Z = (E.1.15) 

j 

where now the sum is not over microstates but over eigenstates, the Ej being 
eigenvalues of a quantum Hamiltonian ET of a system — say Hii/j = Ejy/j for 
a complete orthonormal basis of a suitable Hilbert space. If we form 

the density operator p^= then we can regard Z in (E.1.15) as its trace. 
Analogous to formula (E.1.9) one has the formula 

e~PEj 

Pj = (E.1.16) 

for the probability pj that the system has energy Ej. It should be pointed out 
that for many systems, where there are infinite degrees of freedom, the density 
operator p need not be of trace class. Of course H also might have continuous 
contributions to its spectrum. It is clear therefore that our little excursion into 
some of the preceding matters is by nature informal. The notion of “microstates,” 
for example, also requires a far more serious discussion. The reader may consult 
John von Neumann’s treatise [84] which gives careful attention to the thermody- 
namics of quantum mechanical ensembles, among other matters. Theorem 14.2 
shows that for a harmonic oscillator the Euclidean path integral representation of 
Z given in (14.1.2) agrees with the representation (E.1.15). 

Boltzmann’s name has come up a couple of times. This Austrian physicist 
(Ludwig Boltzmann (1844-1906)) was absolutely convinced of the reality of 
atoms. His brilliant ideas were met with sharp criticism during his lifetime. De- 
spite the rejection of his molecular or atomistic kinetic theory of gases, for exam- 
ple, he continued to make outstanding discoveries that marked him as the creator 
of statistical thermodynamics. His formula S = KlogW which relates the en- 
tropy S and the probability of a given macroscopic state was the starting point 
for Planck’s quantum mechanics theory. 

Work on Brownian motion by Einstein and Smoluchowski, and experiments 
of Jean Perrin, for example, eventually confirmed once and for all the existence 
of atoms. Boltzmann, however, never lived to see his work find vindication. The 
many attacks on him wounded him deeply to the point of suicide. Peace to his 
spirit. And to him and other great atomists, like Democritus and John Dalton, 
many salutations. 
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